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Abstract 


This  thesis  found  a  surveillance  plan  to  be  used  to  monitor  the  shelf-life  of  the  Air  Force’s 
chemical  defense  coveralls.  The  sampling  tests  are  destructive  and  the  objective  was  to  find  a 
surveillance  plan  that  would  minimize  the  number  of  suits  sampled  yet  provide  enough  information 
to  accurately  determine  if  the  chemical  defense  coveralls  had  degraded  past  a  minimum  acceptable 
level. 

Six  sampling  plans  were  developed  and  compared.  The  plans  included  the  Air  Force’s 
original  fixed  sampling  plan,  several  variations  of  sequential  sampling  plans,  a  Bayesian  sampling 
plan,  and  an  ad  hoc  pre-posturing  plan. 

Simulations  were  run  with  each  plan  under  various  degradation  functions  and  various 
values  for  the  test  variables.  It  was  determined  that  the  truncated  sequential  plan  showed  the  most 
promise  for  accurately  predicting  if  a  population  of  suits  should  be  accepted  or  rejected  using  a 


minimal  amount  of  suits. 


SURVEILLANCE  PLAN  FOR  MONITORING  THE  SHELF-LIFE 
OF  CHEMICAL  DEFENSE  COVERALLS 


1.  Introduction 


1.1  Statement  of  Problem 

A  cost  effective  surveillance  plan  is  needed  to  monitor  the  shelf-hfe  of  chemical  defense 
coveralls  in  the  United  States  Air  Force's  inventory.  The  surveillance  plan  should  detect 
deterioration  in  the  reliaWlity  of  the  chemical  defense  coveralls  and  provide  the  decision  makers 
with  the  knowledge  and  insight  needed  to  determine  the  coveralls'  shelf-hfe. 

1.2  Background 

The  CWU-66/P  is  a  chemical  defense  coverall  worn  as  an  outer  garment  by  Air  Force  flight 
personnel  for  protection  against  chemical  warfare.  The  suits  currently  have  a  projected  shelf-hfe 
often  years  and  cost  approximately  $375  each.  The  Air  Force  currently  has  15,848  of  these  suits 
in  inventory  that  were  manufactured  in  1990,  and  plans  to  purchase  40,000  more  suits  to  be 
manufactured  in  fiscal  year  1996.  The  Air  Force  also  has  21,980  units  of  an  older  version  of 
chemical  defense  coveraUs  in  inventory.  This  suit,  the  CWU-77/P,  was  also  manufactured  in 
1990.  In  addition  to  these  suits,  the  Department  of  Defense  plans  to  have  a  joint  service  chemical 
defense  suit  with  the  potential  of  one  milhon  suits  to  be  manufactured  and  purchased  by  the  four 
armed  services. 

The  newly  manufactured  CWU-66/P  wiU  be  manufactured  in  approximately  32  lots,  with 
each  lot  containing  an  average  of  1500  suits.  Arrangements  have  been  made  with  the  manufacturer 
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to  have  15  suits  randomly  pulled  from  each  production  lot  These  suits  will  then  be  reserved  for 
sampling.  In  addition,  60  suits  each  of  the  CWU-66/P  and  CWU-77/P  will  be  pulled  from  existing 
inventory  to  be  used  for  sampling.  A  graph  of  the  current  inventory  and  number  of  chemical 
defense  coveralls  set  aside  for  sampling  are  shown  in  Figures  1  and  2.  AH  suits  that  are  puUed  for 
sampling  will  be  sent  to  Natick  Research,  Development,  and  Engineering  Center  for  storage  and 
testing. 


40000 
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30000 
25000 
20000 
15000 
10000 
5000 
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CWU-  CWU-  CWU- 

66/Pinlnv.  66/P  to  77/P 

Buy 

Figure  1.  Inventory  of  Chem  Suits 


In  Inv.  to  Buy 

Figure  2.  Chem  Suits  for  Sampling 


Cost  of  the  testing  will  be  approximately  $625  per  suit.  The  suits  will  be  tested  for  simulant 
adsorption,  breaking  strength,  tearing  strength,  seam  strength,  water  adsorption  and  spray  rating. 
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The  testing  also  includes  a  visual  inspection  of  the  packaging  to  ensure  that  it  is  air  and  water  tight, 
and  an  adhesion  test,  both  of  which  are  pass/fail  visual  tests. 

The  current  sampling  plan  to  be  used  by  the  Air  Force  follows  the  sampling  plan  developed 
by  the  Army  to  monitor  their  own  chemical  defense  suits.  This  plan  divides  the  total  population  of 
stored  suits  into  three  groups  based  on  which  lot  they  came  from.  Three  suits  from  each  lot  will  be 
tested  in  year  zero.  This  will  give  a  baseline  for  the  test  results  in  the  later  years  to  be  compared 
to.  The  rest  of  the  suits  will  be  tested  annually  starting  in  year  five,  with  samples  being  drawn 
from  one  of  the  three  groups  of  suits.  Using  a  rolling  three  year  sampling  plan,  samples  from  each 
group  will  be  tested  every  third  year.  Table  1  shows  how  each  lot  is  placed  in  a  group  and  what 
years  the  lots’  samples  will  be  tested. 


Table  1.  Lot  Sampling  Plan 


Group 

Lots 

Years  Tested 

1 

1,4,7,...,28,31 

5th,  8th,  11th,  14th 

2 

2,5,8,...,29,32 

6th,  9th,  12th,  15th 

3 

3,6,9,...,27,30 

7th,  10th,  13th,  16th 

The  Army  delegated  the  authority  to  Natick  Research  to  decide  whether  to  accept  or  reject 
a  population  of  suits  based  upon  the  sampling  results.  The  results  of  the  individual  tests  on  each 
suit  wiU  be  available  to  the  Air  Force  for  their  own  analysis. 

1.3  Scope 

Ibis  research  wiU  attempt  to  extend  the  projected  10  year  shelf-life  and  increase  the  overaU 
reliability  of  the  chemical  defense  coveraU  by  focusing  on  the  surveUlance  plan  for  its  shelf-Ufe. 

By  minimizing  the  number  of  samples  needed  for  a  high  degree  of  confidence  in  the  reliability  of 
the  suit,  money  is  saved  in  the  sampling/testing  cost.  Sample  size  wiU  be  minimized  by  exploring 
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the  use  of  Bayesian  statistical  techniques  and  sequential  sampling.  If  the  results  of  this  sampling 
plan  indicate  the  reliability  of  the  suits  exceed  the  projected  10  year  shelf-life,  then  this  will  reduce 
the  life  cycle  costs  by  extending  the  life  of  the  suit. 

In  order  to  minimize  the  cost  of  the  surveillance  plan,  this  plan  will  take  into  consideration 
that  the  Air  Force  is  the  owner  of  the  suits,  and  therefore,  will  take  a  loss  from  either  a  Type  I  or 
Type  II  error  in  the  sampling.  That  is,  if  a  population  of  suits  is  accepted  when  it  should  have  been 
rejected,  or  rejected  when  it  should  have  been  accepted,  the  Air  Force  incurs  a  loss.  In  order  to 
reduce  the  computational  complexities,  it  is  assumed  that  testing  is  100%  accurate.  If  we  do  not 
assume  testing  is  100%  accurate,  we  would  need  to  compute  some  type  of  probability  distribution 
that  would  model  the  probabilities  of  how  far  the  tests  results  are  from  the  true  values,  and  the 
calculations  would  become  rather  unwieldy  for  the  scope  of  this  thesis. 

It  is  also  assumed  the  suits  will  degrade  over  time.  This  research  will  look  for  a  method  to 
connect  previous  years'  test  data  with  the  current  year's  testing  in  order  to  account  for  a 
degradation  in  the  reliability.  Degradation  in  the  reliability  of  protective  fabrics  is  known  to  be 
caused  by  oxidation  and  hydrolysis.  The  sponsor  stores  the  suits  that  are  to  be  sampled  in  an 
environmentally  controlled  warehouse,  therefore,  we  cannot  take  into  account  the  possibility  that 
storage  conditions  may  have  an  effect  on  the  suits’  reliability.  The  chemical  suits  are  not  stored 
under  controlled  conditions  in  the  field.  Future  sampling  plans  may  want  to  store  the  suits  in 
varied  conditions,  i.e.,  hot,  cold,  humid,  etc.,  or  draw  the  suits  randomly  from  the  field  and  take 
into  consideration  how  the  storage  conditions  may  effect  the  reliability  of  the  suits. 

This  surveillance  plan  is  intended  to  be  a  decision  tool  for  the  decision  maker  rather  than  a 
decision  maker  in  its  own  right.  That  is,  the  surveillance  plan  is  not  intended  to  explicitly  decide 
whether  to  reject  or  accept  a  population  of  suits.  Rather  it  wiU  be  a  tool  for  the  decision  maker  to 
use  to  help  him  make  a  decision  as  to  reject  or  accept  a  population  of  suits.  The  decision  maker 
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will  also  take  into  account  reliability  thresholds,  budgets,  and  the  Air  Force's  own  needs  in  his  final 
decision. 

1.4  Research  Approach 

In  order  to  derive  a  high  degree  of  confidence  in  the  reliability  of  the  suits  with  a  small  sample 
size,  an  approach  other  than  fixed  classical  statistical  sampling  is  needed.  Bayesian  statistical 
approaches  and  sequential  sampling  will  be  explored  in  order  to  increase  confidence  in  the 
reliability  of  the  suits  while  using  small  sample  sizes.  Bayesian  statistics  allows  the  use  of  prior 
data  and  expert  knowledge  in  determining  the  reliability  of  the  suits. 

Many  dilferent  approaches  incorporate  the  use  of  Bayesian  statistics  and  sequential  sampling. 
These  approaches  will  be  examined  and  analyzed  to  detemiine  which  approaches  will  work  best  for 
the  stated  problem.  Since  data  firom  the  Army’s  tests  on  their  chemical  suits  is  available,  one 
approach  may  be  to  incorporate  this  prior  data  into  the  Air  Force's  assessment  of  the  chemical 
defense  coverall's  reliability.  Another  approach  will  be  the  use  of  sequential  testing.  Sequential 
testing  allows  the  testing  to  be  terminated  earlier  in  the  sampling  if  it  is  clear  that  the  test  results 
will  result  in  accepting  or  rejecting  a  sample. 

With  the  completion  of  a  sampling  model,  computer  simulation  will  be  used  to  confirm  that 
the  surveillance  methodology  works.  Various  distributions  will  be  used  to  model  the  reliability 
degradation  of  the  suits  in  the  simulation  to  measure  the  robustness  of  the  proposed  methodology. 
The  simulation  will  ensure  that  the  model  detects  reliability  degradation  within  the  producer’s  and 
consiuner’s  risk  thresholds  that  are  given. 

After  all  the  sampling  methodologies  are  developed  and  the  output  of  the  simulations  are 
analyzed,  a  summary  and  recommendation  will  be  made.  We  will  recommend  which  sampling  plan 
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appears  to  woik  best  for  sampling  the  chemical  defense  suits  and  conclude  with  comments  that 
may  help  the  decision  maker  in  detennining  when  to  accept  or  reject  a  population  of  suits. 
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2.  Literature  Review 


2.1  Introduction 

This  literature  review  considers  current  concepts  used  in  sequential  sampling  and  Bayesian 
attribute  sampling  plans  that  can  be  applied  to  develop  a  surveillance  plan  to  monitor  the  shelf-life 
of  chemical  defense  coveralls. 


2.2  Definition  of  Surveillance  Testing 

AFOTECP  400-1,  describes  surveillance  testing  as  the  following: 

Surveillance  testing  generally  requires  that  a  number  of  preproduction  or  production 
systems  be  placed  in  actual  or  simulated  field  storage  conditions.  Periodically,  selected 
samples  of  these  assets  are  removed  from  storage  and  examined  for  degradation  from  original 
specifications.  The  surveillance  program’s  value  to  the  operational  tester  lies  in  the 
availability  of  similar  system  data  upon  which  to  base  a  comparability  analysis  when 
developing  an  early  system  reliability  prediction.  (AFOTECP  400-1, 1991  :A2-4) 


2.3  Plan  of  Development 

This  review  first  briefly  introduces  attribute  and  variable  sampling  followed  by  an 
introduction  to  the  development  and  concept  of  Bayesian  statistics.  With  sampling  and  Bayesian 
statistics  introduced,  concepts  in  the  use  of  Bayesian  statistics  in  attribute  sampling  follows. 

Having  a  background  of  the  concepts  needed  to  understand  the  applications  of  Bayesian 
statistics  in  attribute  sampling,  methods  are  given  for  determining  the  prior  distribution  of  the 
probability  parameter  p  in  a  binomial  distribution.  This  is  followed  by  a  review  of  the  applied 
methodologies  that  are  used  in  the  application  of  Bayesian  statistics  to  attribute  sampling. 

Midway  through  the  thesis  process,  it  was  found  that  the  test  results  would  not  be  reported  as 
attribute  data,  but  as  variable  data.  Therefore,  we  started  to  concentrate  on  variable  sampling 
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plans.  Sequential  sampling  using  variable  data  is  introduced  with  a  sequential  sampling  plan  given 
for  normally  distributed  test  items.  The  chapter  will  close  with  a  look  at  other  sampling  methods 
that  deal  with  the  problem  of  small  sample  sizes  and  destmctive  testing. 

2.4  Sampling 

Montgomery  states  that  the  purpose  of  sampling  is  to  sentence  lots  (accept  or  reject  them),  not 
to  estimate  lot  quality  (Montgomery,  1991:552).  Sampling  is  not  designed  to  estimate  the 
reliability  or  quality  of  the  lots.  Rather,  a  decision  is  made  whether  to  accept  or  reject  the  lot  based 
upon  the  results  of  samples  that  are  drawn  from  a  lot  and  tested.  Sheng  and  Fan  note  that  there  is 
no  mathematical  difference  between  using  quality  or  reliability  in  terms  of  defining  the  unknown 
characteristic  of  the  percentage  of  the  lot  that  is  good  (Sheng  and  Fan,  1992:307).  The  remainder 
of  this  paper  uses  quality  and  reliability  interchangeably. 

In  attribute  sampling,  the  results  of  the  tests  on  the  items  in  a  sample  are  recorded  as  either  a 
success  or  failure.  In  the  most  basic  attribute  sampling  plan,  the  single  sampling  plan,  the  decision 
to  accept  or  reject  a  lot  is  based  upon  the  results  of  only  one  sample.  The  producer  and  consumer, 
that  is  "the  party  submitting  the  product  for  acceptance  and  the  party  for  whom  the  decision  is 
made  regarding  acceptance  or  rejection"  (Grant  and  Leavenworth,  1988:402),  agree  upon  three 
numbers  before  any  sampling  is  done.  These  three  numbers  are:  N,  the  number  of  items  in  the  lot 
finm  which  the  sample  is  to  be  drawn;  n,  the  number  of  items  to  be  randomly  pulled  liom  the  lot; 
and  c,  the  maximum  number  of  failures  allowed  in  the  sample  in  order  to  accept  the  lot.  These 
numbers  are  usually  derived  through  classical  statistics  and  as  a  minimum,  are  functions  of:  lot 
size,  required  confidence  intervals,  required  quality  of  items,  and  the  a  and  P  risk,  which  are  now 

defined. 
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The  a  risk  is  defined  as  the  probability  of  rejecting  a  lot  when  the  quality  of  the  lot  is 
acceptable,  and  the  p  risk  is  defined  as  the  probability  of  accepting  a  lot  when  the  quality  of  the  lot 
is  not  acceptable.  The  a  and  P  risks  are  also  known  as  the  producer's  risk  and  consumer’s  risk 
respectively. 

In  variable  sampling,  the  actual  numerical  value  acquired  in  a  sampling  test  is  reported,  as 
opposed  to  the  pass/fail  value  given  in  attribute  sampling.  The  probability  distribution  of  the 
characteristic  being  tested  is  assumed  to  be  known.  There  are  many  different  variable  sampling 
plans  available.  For  variable  sampling,  the  focus  will  be  on  sequential  sampling  plans.  The  two 
main  advantages  of  variable  sampling  over  attribute  sampling  is  that  variable  data  usually  provides 
more  infonnation  on  the  characteristic  being  sampled  and  variable  sampling  requires  less  samples 
to  be  drawn  from  a  population  in  order  to  meet  the  a  and  p  requirements. 

2.5  Bayesian  Statistics 

Bayesian  statistics  allows  the  use  of  prior  knowledge,  either  in  the  fonn  of  past  data  or 
subjective  judgment  from  experts,  to  be  used  in  reliability  analysis.  Qassical  statistics  does  not 
allow  prior  knowledge  to  be  used,  so  in  a  sense,  it  is  lost  knowledge.  Two  important  practical 
benefits  are  gained  by  using  Bayesian  statistics  in  reliability  analysis.  The  first  is  the  increased 
quality  of  the  reliability  analysis  that  results  from  an  accurate  assessment  of  the  prior  knowledge. 
The  second  is  the  reduction  in  sample  size  that  usually  occurs  as  a  result  of  the  use  of  the  prior 
knowledge  (Martz  and  Waller,  1982:173). 

The  disadvantages  of  Bayesian  statistics  are  the  subjective  choices  of  the  prior  distribution 
and  the  parameters  for  the  prior  distribution.  There  are  of  course  no  mathematical  formulas  for 
subjective  judgments,  so  the  choice  of  a  prior  distribution  that  is  subjectively  but  thoughtfully 
chosen  is  always  open  for  argument  and  disagreement.  Along  with  the  choice  of  the  prior 
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distribution  is  the  subjective  choice  of  what  data  the  prior  is  based  on.  Often  prior  distributions  are 
based  upon  data  obtained  from  past  tests  on  systems  similar  to  the  system  for  which  the  prior  is 
being  constructed.  This  leads  again  to  disagreements  on  which  systems  can  be  assumed  to  be 
similar  to  the  new  system  being  modeled.  Also,  it  usually  caimot  be  shown  or  demonstrated  that 
the  prior  distribution  is  really  from  a  certain  probability  distribution  function.  The  prior 
distribution  is  often  chosen  so  that  it  is  mathematically  tractable. 

Bayesian  statistics  is  based  on  a  theorem  by  the  Reverend  Thomas  Bayes  who  presented  it 
in  the  eighteenth  century.  The  well-known  theorem  is  known  as  Bayes  theorem  and  is  as  follows: 


Pr(AIB)  = 


Pr(AnB) 

Pr(B) 


(1) 


In  Eq  (1),  both  sides  are  multiplied  by  Pr(B)  and  the  result  is 

Pr(AnB)  =  Pr(B)Pr(AIB)  (2) 


Rewrite  Eq  (2)  as 


Pr(AnB)  =  Pr(A)  Pr(  Bl  A  ) 


(3) 


and  replace  the  result  of  Eq  (3)  into  Eq  (1).  The  result  is 


Pr(AIB)=Pr(A) 


Pr(BI  A ) 
Pr(B) 


(4) 


where  Pr(A)  is  the  prior  probability  of  event  A  occurring  before  any  information  from  B  is 
obtained,  and  Pr(AIB)  is  the  posterior  probability  of  A  given  the  information  from  B  (Kapur  and 
Lamberson,  1977:368). 
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Eq(4)  is  used  to  apply  Bayes  theorem  to  probability  distributions.  Let  x  have  a  probability 
density  function  (pdf)/W,  dependent  on  a  parameter  0 .  In  classical  statistics,  0  is  assumed  to  be 
an  unknown  constant,  but  in  Bayesian  statistics,  0  is  considered  to  be  a  random  variable,  and  our 


belief  in  the  value  of  0  is  described  by  a  pdf,  say  h(  0 ). 

Assume  a  random  sample  of  n  items  is  drawn  from  a  population  whose  distribution  is  the  pdf 
f(x),  and  let  y  be  a  function  of  this  random  sample.  Then  we  have  a  conditional  pdf  giyj  0  j  for  y 
given  0 .  The  joint  pdf  for  y  and  0  is 

f(Q,y)  =  h(0)g(yl0)  (5) 

and  assuming  0  is  continuous 

f2(y)=jh(e)g(yiQ)de  (6) 

e 


which  is  the  marginal  pdf  for  y.  Then  using  Eq  (4),  the  conditional  pdf  for  0  given  the  information 
y  can  be  written  as 


^('0/yj  = 


h(B)g(ylQ) 

f2(y) 


fi  (y)  >  0 


(7) 


where 

miy)  = 

posterior  pdf  of  0  given  y 

h(B) 

prior  pdf  of  0 

giyIB)  = 

likelihood  of  y  given  the  value  of  0 

h  (y) 

marginal  pdf  fory 
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k(  Q I  y)  is  now  the  updated  pdf  for  0  given  the  new  information  y  (Kapur  and  Lamberson, 
1977:372).  Many  texts  on  Bayesian  statistics  refer  to  the  prior  pdf  and  posterior  pdf  simply  as  the 
prior  and  the  posterior.  This  thesis  will  follow  this  convention. 

In  Eq  (7),  it  is  /i(  0 ),  the  prior  pdf,  where  most  of  the  controversy  of  Bayesian  statistics  comes 
from.  The  prior  pdf  is  often  not  objectively  determined,  but  rather,  subjectively  determined.  If  the 
prior  could  be  determined  objectively,  the  rest  of  the  Bayesian  approach  falls  out  mathematically 
and  there  would  be  little  cause  for  controversy  in  the  Bayesian  approach.  However,  since  we  are 
using  experts'  educated  opinions  and  prior  sampling  data  that  may  or  may  not  correspond  nicely  to 
our  current  sampling,  this  subjectivity  will  remain  controversial. 

2.6  Using  Bayesian  Statistics  In  Attribute  Acceptance  Sampling 

The  most  important  and  controversial  decision  in  the  use  of  Bayesian  statistics  in  attribute 
sampling  is  the  choice  of  the  prior  distribution.  Not  only  does  it  need  to  be  decided  what  kind  of 
distribution  the  prior  pdf  should  take,  but  it  also  must  be  decided  what  values  are  to  be  assigned  to 
the  parameter's)  in  the  prior. 

A  common  approach  in  choosing  a  pdf  for  the  prior  distribution  is  to  choose  a  conjugate  prior 
distribution  for  the  given  sampling  distribution.  A  conjugate  prior  distribution  is  a  distribution  that 
causes  the  prior  and  posterior  distributions  to  always  be  of  the  same  family  of  distributions,  hi  this 
thesis,  the  sampling  distribution  is  the  binomial  distribution,  and  its  conjugate  prior  distribution  is 
the  beta  distribution.  No  matter  how  many  times  the  Bayesian  approach  is  used  on  the  binomial 
sampling  distribution  with  a  beta  prior,  the  posterior  distribution  will  always  be  of  the  beta  family. 
By  using  a  conjugate  prior,  the  resulting  posterior  distributions  are  always  mathematically 
tractable,  which  leads  to  the  criticism  that  the  conjugate  prior  is  chosen  purely  for  its  mathematical 
tractability. 
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While  this  criticism  is  true  for  those  who  those  who  blindly  choose  a  conjugate  solely  for 
tractability,  it  has  been  shown  that  for  the  binomial  sampling  distribution,  the  beta  family  is  the 
best  choice  for  the  prior  distribution,  in  that  it  is  robust  to  deviations  from  the  beta  distribution. 
Weiler  shows  that  if  one  uses  a  beta  prior  distribution,  when  the  true  prior  distribution  is  not  from 
the  beta  family,  that  the  impact  is  negligible  in  many  practical  applications  (Weiler,  1965:335- 
347).  Dyer  shows  that  for  three  different  conditions:  1),  when  there  is  no  available  prior  estimate 
for  p\  2),  when  there  is  only  a  point  estimate  for  p\  and  3),  when  there  is  an  interval  estimate  for 
p,  where  p  is  the  probability  of  success  in  the  binomial  distribution,  the  beta  pdf  is  the  best  choice 
compared  to  three  other  distributions.  Best  in  the  sense  that  the  posterior  distribution  resulting 
from  using  the  beta  prior  is  the  most  accurate  and  robust  in  all  three  given  conditions  (Dyer, 
1984:2051-2083).  Dyer  compared  the  beta  pdf  to  three  other  common  distributions  likely  to  be 
chosen  as  the  prior.  These  three  other  distributions  were  the  normal,  negative  log-gamma,  and 
kummer  distributions. 

Although  these  two  authors  give  good  arguments  for  using  a  standard  beta  for  a  prior,  Pham 
and  Turickan  make  the  point  that  a  general  beta  could  be  used  as  a  prior  instead.  They  argue  that 
in  practice,  the  reliability  of  an  item  is  usually  high  and  that  the  limits  of  the  values  for  the 
reliability  are  usually  known.  For  example,  the  reliability  of  the  chem  suits  may  be  known  to  be 
.75  <p  <  .99,  and  by  ignoring  these  boimds,  important  information  is  lost.  The  standard  beta 
takes  on  all  values  in  [0, 1],  but  the  general  beta  is  defined  on  the  closed  interval  \pl,  p2\,  where  0 
^p,  ,P2<1. 

Whereas  the  standard  beta  is  a  conjugate  to  the  binomial  distribution,  the  general  beta  is  not. 
In  their  paper,  Pham  and  Turkkan  give  attention  to  the  sensitivity  of  the  posterior  distribution  when 
using  a  standard  beta  prior  distribution  when  the  actual  prior  distribution  is  a  general  beta 
distribution  (Pham  and  Turickan,  1992:  310-316).  Using  a  standard  beta  for  the  prior  when  a 
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general  beta  is  more  appropriate,  they  show  the  sensitivity  of  the  posterior  distribution  is  not  all 
that  extreme  when  using  the  wrong  beta  distribution.  Considering  the  prior  is  often  subjective 
anyway,  there  is  no  great  loss  in  information  when  using  a  standard  beta  when  the  actual  prior  is  a 
general  beta. 

But  for  whatever  choice  is  made  for  the  prior,  the  analyst  must  be  able  to  give  good  reason 
and  justification  as  to  why  the  prior  was  chosen.  Once  a  choice  is  made  for  the  prior,  a 
preposterior  analysis  is  recommended  by  Martz  and  Waller  (Martz  and  Waller,  1982:187).  This 
procedure  analyzes  the  prior  distribution  before  any  sample  data  is  collected  by  noting  its  impact 
on  contradictory  and  confirming  hypothetical  data.  The  procedure  consists  of  four  steps: 

1 .  With  the  amount  of  sample  data  expected,  make  up  a  set  of  likely  and  unlikely  sample 
data. 

2.  Using  the  proposed  prior  distribution  and  each  set  of  hypothetical  sample  data, 
compute  the  posterior  distribution  using  Bayes'  theorem. 

3.  Study  the  posterior  distributions  noting  if  they  seem  reasonable  with  their  respective 
hypothetical  sample  data. 

4.  If  the  posterior  distributions  seem  reasonable,  the  proposed  prior  may  be  a  valid 
candidate  for  use.  If  they  are  not  reasonable,  adjust  the  prior  and  go  back  to  step  2. 

This  preposterior  analysis  will  help  in  validating  a  proposed  prior  distribution  before  any  sample 
data  is  collected. 

As  mentioned  earlier,  a  strong  justification  of  the  prior  distribution  must  be  documented  in 
order  to  have  a  credible  Bayesian  acceptance  sampling  analysis.  Along  with  documentation  on  the 
choice  of  the  prior  and  the  previously  given  preposterior  analysis,  a  clearly  defined  posterior 
distribution  on  the  parameter(s)  of  interest  and  an  analysis  of  the  sensitivity  of  the  Bayesian 
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inferences  to  the  prior,  ensures  a  well  defined  Bayesian  acceptance  sampling  analysis  (Martz  and 
Waller,  1982:189).  We  now  move  on  to  methods  that  have  been  developed  for  constructing  a  prior 
distribution  for  the  binomial  sampling  distribution. 

2.7  Methods  for  Constructing  a  Prior  Distribution 

Having  developed  a  foundation  for  Bayesian  statistics  in  acceptance  sampling,  an  examination 
and  review  of  the  applications  currently  used  for  constmcting  a  prior  is  warranted.  The  most 
commonly  used  and  easily  imderstood  applications  will  be  reviewed  first.  The  complexity  of  the 
appUcations  will  increase  as  they  are  introduced.  Most  of  the  applications  will  concentrate  on  a 
binomial  sampling  distribution  with  a  beta  prior. 

2 . 7. 7  Uniform  Prior. 

One  of  the  simplest  priors  for  the  binomial  sampling  distribution  is  the  uniform  distribution. 
The  uniform  prior  is  a  special  case  of  the  beta  distribution.  It  is  often  used  when  the  analyst  does 
not  have  any  prior  information  to  construct  a  more  informative  prior.  It  essentially  says  that  every 
value  of  p  (0  <  p  <  1 ),  which  is  the  probability  of  a  successful  test,  has  an  equal  chance  of  being 
true.  The  development  of  the  prior  is  shown  here  for  the  reader  who  is  not  familiar  with  Bayesian 
statistics.  The  posterior  distribution  for  a  binomial  sampling  distribution  with  a  uniform  prior  is 
derived  as  follows: 


hip) 

=  1 

The  pdf  for  the  uniform(0,l)  distribution. 

g(xlp) 

The  binomial  pdf  of  having  x  successes  in  n  trials. 

fiix) 

1 

=  j  g(xfp)h(p)dp 

0 


15 


=  ^(J)dp  The  marginal  pdf  for  X 


Placing  the  above  equations  into  Eq  (7)  the  posterior  is  obtained: 

\p-(i-pr-(i) 


k(plx)=  -J 


l[iy(i-pr^(i)dp 


(8) 


where 


p(x+l)-l(j  _  p  jn-x+iy-l 
J  p(^+iH(j  _  p  dp 


0  <p<l 


(9) 


\p(^+iH(j_p r(x+l)r(n  x+1)  ^  ^  ^ 
^  r(n+2) 


(10) 


Substituting  Eq  (10)  into  Eq  (9),  k(p  /  x)  is  a  beta  distribution  with  parameters  x+1  and  n-x+1, 
which  can  be  written  as  B(x+1,  n-x+1 ),  where  x  is  the  number  of  successful  trials  and  n  is  the  total 
number  of  trials.  The  expected  value  of  this  posterior  is  simply  the  expected  value  of  the  B(x+1,  n- 
x+1)  which  is  (x+1) !  (n+2). 

2.7.2  Expert’s  Subjective  Prior. 

In  this  method  of  defining  a  prior  distribution,  the  analyst  interviews  an  expert  of  the  system 
to  be  sampled.  The  expert  is  defined  as  someone  who  is  intimately  familiar  with  the  items  to  be 
sampled.  The  expert  gives  his  educated,  subjective  estimation  of  the  following  values  for  p,  the 
probability  of  a  successful  Bernoulli  test  from  an  item  taken  fi-om  a  lot  to  be  sampled: 

The  prior  mean  probability  PI,  where  PI  =  E[P], 
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The  prior  95th  percentile  P2,  where  Pr(P  >  P2)  =  0.05, 

The  prior  5th  percentile  P3,  where  Pr(P  <  P3)  =  0.05 

That  is,  before  the  sampling  is  performed,  there  is  only  a  5%  chance  that  the  value  p  will  exceed 
P2  and  only  a  5%  chance  that  the  value  of  P  will  be  less  than  P3  (Martz  and  Waller,  1982:236). 

This  method  provides  a  means  for  translating  information  about  percentiles  and  means  into  a 
beta  prior  distribution.  By  picking  any  combination  of  pairs  of  fiom  PI,  P2  or  P3,  the  analyst  is 
able  to  get  unique  values  for  the  parameters  xo  and  no  to  be  used  in  the  beta  prior  distribution, 

B(xo ,  no  ).  The  values  are  obtained  from  a  table  that  was  constracted  using  numerical  methods. 
The  table  simply  gives  the  unique  beta  distribution  that  corresponds  to  the  pair  of  percentiles  and 
mean  given  by  the  expert.  The  posterior  distribution,  given  n  trials  with  x  successes,  can  then  be 
constructed  from  Eq  (7).  With  a  prior  distribution  of  B(xo ,  no  ),  the  posterior  distribution  is  then 
B(xo  +  x,no  +  n-  x),  where  n  is  the  number  of  trials  and  x  is  the  number  of  successful  trials. 

2.7.3  Method  of  Moments. 

The  method  of  moments  allows  a  beta  prior  to  be  formed  based  on  previously  observed 
sample  data  from  similar  past  experiments  on  similar  items.  Assume  a  sequence  of  N  sets  of 
binnmial  sampling  tests  were  run,  where  for  each  set  of  tests,  there  were  Xj  successes  out  of  a 
sample  size  «t/ ,  for  j  =  Ito  N,  and  pj  unknown.  Also  assume  that  the  sample  sizes  do  not  vary  too 
greatly  from  test  to  test. 

Martz  and  Waller  show  that  the  sample  mean  for  the  probability  of  a  successful  trial,  , 

and  second  sample  moment,  ,  about  the  origin  of  the  sequence  pj ...  p;^  are  defined  as: 


N  N  ^2 

IT-  X'  »,2  -XEl. 

Pu  ^  ^  u  ^  ^ 
J=1 


(11) 
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where  pj  is  defined  as  xj  I  nj.  By  the  use  of  the  method  of  moments  procedure  (Mann,  Schafer 

and  Singpurwalla,  1974:46-55),  equate  the  sample  moments  to  their  expected  values,  and  solve  for 
xo  and  rto .  the  parameters  for  the  beta  prior.  Using  this  method,  xo  and«o  are  found  by  solving 
the  following  equations: 


NmI-KP^-(N-K)P^  ’ 


N 
i=i 


-1 


(12) 


and 


Xo  =  fioPu 


(13) 


In  the  case  where  the  difference  in  the  sample  sizes  of  the  tests  are  severe,  weights  are  given  to  the 
sample  moments  (Martz  and  Waller,  1982:312-315). 

2 . 7.4  Marginal  Maximum  Likelihood. 

Like  the  method  of  moments  just  presented,  the  marginal  maximum  likelihood  also  allows  a 
beta  prior  to  be  formed  based  on  previously  observed  sample  data  from  similar  past  experiments  on 
similar  items  (Martz  and  Waller,  1982:316-318).  This  method  finds  the  parameters  of  the  beta 
prior  by  maximizing  the  marginal  likelihood  function.  The  marginal  distribution  of  Xj ,  where 
j=l..N, is 


f(Xj,xono)  = 


Hj  !r(no  )r( Xj  +xo)T( Hj  +no-Xj-xo) 
Xj  !( nj  -  Xj  )!T ( nj  +no)T( xq  )T(no-xo  ) 


(14) 


and  the  marginal  likelihood  is  just  the  product  of  these,  for  J  =  1..N.  The  marginal  likelihood  is 
then  maximized  by  finding  the  appropriate  values  for  xo  and  «o .  By  taking  the  log  of  the 
likelihood  and  differentiating  with  respect  to  xo  and  no,  the  following  equations  are  found: 


18 


a/nL 


rij  >  Xj  >1 


(15) 


N  f  1  \  N  "j 

=y  y  — — y  y 

0  jwS(^»+‘V  ;W  5  V"o-=‘o*‘ 


m-Xi-l , 


dxi 


dlnL 
dn. 


N 


0  §  §  Kno-xo+i)  p^yno-+i) 


N 

-IS 


,  Itj  >  Xj 


(16) 


Note  in  Eq  (15),  ifxj  =0  or  xj  =  rij ,  then  the  corresponding  summation  on  i  is  defined  to  be  zero. 

Equating  these  equations  to  zero  and  solving  simultaneously,  the  maximum  likelihood 
estimates  for  xq  and  no  are  obtained.  These  equations  must  be  solved  numerically,  and  starting 
values  are  taken  from  the  method  of  moments  estimates,  xo  and  no  are  then  the  parameter 
estimates  for  the  beta  prior. 

2 . 7.5  Empirical  Bayes  Single  Sampling  Plans. 

This  method  considers  a  Bayesian  attribute  sampling  plan  without  explicitly  defining  a  prior 
distribution  (Martz,  1975:652-653).  It  derives  its  prior  from  past  data  from  similar  lots  of  items 
that  were  sampled. 

Reasons  to  use  the  empirical  methodology  instead  of  the  method  of  moments  and  the  marginal 
maximum  likelihood,  is  that  the  assumption  of  a  prior  is  sometimes  undesirable  when  the  lot 
fraction  defective  is  believed  to  be  concentrated  over  a  range  much  smaller  than  [0, 1]  (Pham, 
1990:7).  Another  reason  for  not  explicitly  defining  a  prior  is  that  sometimes  quality  control 
personnel  are  reluctant  to  assume  a  specific  prior  that  cannot  be  easily  verified. 

Consider  m  lots  having  been  sampled,  where  for  lots  j  =  I  to  m,  the  following  are  defined. 
Note  that  before,  p  was  defined  as  the  fraction  of  the  lot  not  defective.  We  will  now  uses  q  as  the 
fraction  defective  in  the  lot: 


fraction  defective  in  jth  lot 
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Nj  lot  size  of  jth  lot 

Kj  sample  size  of  jth  lot 

Xj  number  of  failures  in  jth  sample 

yj  Xj  /  nj 

y  (yi  ,y2  ,-.,ym ) 

Martz  develops  a  method  to  estimate  the  prior  lot  fraction  defective  density  function^^),  based 
upon  a  class  of  kernel  estimators.  f(q)  is  given  as  the  following: 


where 


and 


f(<ily)  = 


— - — X*/. 


<i-yj 


0<q<l 


h-h(m)  =  \ 


m 


H 


J/5 


I  n„ 

I  j=l 


Sm>0 


Sm=0 


(17) 


(18) 


kj(t)  =  \ 


wjnj(l-\t\) 

2wjnjh^(l-\t\)/(2h^  -(yj -hf ) 

2wjnjh^(l-\t\)/ (2h^  -(l-yj -hf )  {\t\<l,yj+h>l} 


{\t\<l,yj-h>0,yj+h<l} 
{\^<l,yj-h<0}  (19) 


In  Eq  ( 1 8),  8  y  =  1  or  0  according  to  whether  wj  >  0  or  wj  =  0,  respectively,  and 

Sm  =  '^Wjnj(y j-yf  I  N  (20) 


where 


y='L^j^jyj '  ^ 


(21) 
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and 


N  =  '^Wjnj 


(22) 


In  this  equaticHi,  wj  is  a  non-negative  weight  that  is  applied  to  the  data  from  the  jth  lot.  This 
weight  is  determined  by  the  analyst,  with  the  idea  being  recent  lots  should  be  weighted  more 
heavily  than  distant  past  lots.  Also  weights  of  zero  could  be  assigned  to  those  lots  that  are  not  to 
be  included  in  the  computation  of  the  prior.  The  constant  IT  in  Eq  (1 8)  should  be  set  to  1  unless 
the  prior  is  believed  to  have  many  local  maxis  and  mins  with  sharp  peaks.  For  the  remainder  of 
this  methodology,  W  and  wj  will  be  set  to  one. 

By  the  use  of  Bayes  theorem,  Martz  came  up  with  a  posterior  probability  for  Pr(Q<ql  x). 
This  equation  is 


Pr(Q<qlx,y) 


lf(xlq)f(qly)dq 

<0 _ 

Pr(X=xfy) 

p 


where 


1 

Pr(X  =  xly)  =  \f(xlq)f(qly)dq 
0 


{Pr(X=xly)^0} 

(23) 

{Pr(X=xly)=0} 


(24) 


Martz  develops  the  equations  for  the  numerator  and  denominator  for  Eq  (23),  which  are  quite 
lengthy  and  complex. 

Martz  then  tested  his  empirical  Bayes  sampling  plan  with  the  Bayes  sampling  plan  where  a 
beta  prior  distribution  was  assumed,  by  using  Monte  Carlo  simulation.  For  unimodal  symmetric  or 
moderately  skewed  priors,  the  Empirical  Bayes  and  beta-prior  Bayes  were  in  general  agreement. 
For  cases  with  highly  skewed  (such  as  U-shaped)  priors  however,  the  Empirical  Bayes 
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methodology  yielded  an  improvement  over  the  beta-prior  Bayes  method.  “The  results  of  the  test 
indicated  that  the  Empirical  Bayes  estimators  quite  satisfactorily  approximated  the  corresponding 
Bayes  estimators”  (Martz,  1975:657). 

Martz  shows  how  to  find  the  optimal  single-sampling  plan  using  Empirical  Bayes  that  attains 
the  specified  posterior  consumer’s  and  posterior  producer’s  risks.  He  defines: 

Posterior  Consumer’s  Risk  =  p  (n,  c)  =  Pr(Q  >  q2 1  lot  accepted,  y) 

Posterior  Producer’s  Risk  =  a  (n,  c)  =  Pr(Q  <  qi  /  lot  rejected,  y) 

where  q2  is  a  value  of  q  specified  by  the  consumer  that  requires  a  high  probability  that  Q  does  not 
exceed  this  value  in  accepted  lots,  and  q\  is  a  value  of  q  specified  by  the  producer  that  requires  a 
low  probability  that  Q  does  not  exceed  this  value  in  rejected  lots.  Recall  that  Q  is  defined  as  the 
fraction  defective.  The  sampling  plan  should  then  satisfy: 

fi(n.c)  <  p  (P  SmaU) 
a(n,c)  <  a  (a Small) 

where  p  and  a  are  the  maximum  values  of  the  specified  risks  declared  by  the  consumer  and 
producer.  The  optimal  sampling  plan  wUl  then  be  the  plan  that  satisfies  the  two  inequalities  and 
has  the  smallest  value  of  n.  The  methods  of  finding  the  optimal  values  of  n  and  c  are  the  same  as 
those  in  Wood’s  method  (Wood,  1983)  that  is  presented  later  in  this  thesis. 

Martz  concludes  that  the  Empirical  Bayes  method  can  be  used  to  achieve  specified  posterior 
consumer  and  producer  risks.  In  general  the  sample  sizes  under  this  Bayes  method  will  be  smaller 
than  those  in  classical  statistics,  especially  when  the  previous  lots  that  were  observed  were  of 
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excellent  quality.  The  main  advantage  of  Empirical  Bayes  is  that  the  prior  does  not  have  to  be 
explicitly  assumed. 

2.8  Bayesian  Acceptance  Sampling  Plans  for  Binomial  Distributions 

This  section  reviews  how  Bayesian  statistics  are  applied  to  acceptance  sampling  plans  using 
attribute  data.  These  two  plans  are  based  upon  typical  attribute  acceptance  plans,  where  the 
number  of  items  sampled  is  minimized  with  respect  to  the  constraints  of  the  consumer’s  and 
producer’s  risks. 

2.8.1  Bayesian  Reliability  Test  Plans  for  One-Shot  Devices. 

This  approach  develops  Bayesian  attribute  acceptance  sampling  plans  for  one-shot  devices 
(Wood,  1983:1-18).  A  one-shot  device  is  an  item  that  is  destroyed  once  it  is  used  or  tested. 
Examples  of  one-shot  devices  are  missiles  and  bombs  and  in  our  case,  can  be  extended  to  chemical 
protective  coveralls.  The  plan  uses  data  from  similar  testing  attained  previously  to  construct  a 
prior.  The  previous  test  data  will  be  referred  to  as  (No  ,  ko  \  where  No  is  the  number  of  trials  and 
ko  is  the  number  of  failures  that  occurred. 

Before  any  testing  begins,  the  consumer  and  producer  agree  on  the  following  values: 

=  maximum  tolerable  value  for  producer’s  posterior  risk 
3  ^  =  maximum  tolerable  value  for  consumer’ s  posterior  risk 

=  reliability  threshold  for  a  calculation 
rp  =  reliability  threshold  for  p  calculation. 

Once  these  values  are  agreed  upon,  they  are  used  with  the  prior  distribution  to  determine  the 
sample  size  for  testing  (nj )  and  to  determine  the  maximum  number  of  failures  (M)  allowed  in 
order  to  accept  the  lot.  After  testing  is  completed,  the  actual  values  for  the  producer’s  and 


23 


consumer’s  risk,  a  and  p ,  are  calculated.  In  this  methodology,  a  is  defined  as  Pr(r>rjreject), 
and  p  is  defined  as  Pr(  r  <  r^l accept ) ,  where  r  is  the  actual  reliability  of  the  lot.  Note  that  earlier 

articles  referenced  in  this  section  used  p  rather  than  r  as  the  reliability  or  probability  an  item  would 
be  successful. 

Once  the  producer  and  consumer  agree  upon  the  values  given  above,  the  prior  distribution  is 
developed.  This  is  accomplished  by  assuming  there  was  a  uniform  prior  distribution  before  the 
event  (No ,  h  )  took  place.  This  uniform  prior  is  then  updated  by  the  event  (No ,  )  in  the  usual 

Bayesian  fashion  and  this  new  posterior  is  now  the  prior  for  the  present  testing.  The  uniform  is  a 
special  case  of  the  beta  distribution,  and  so  is  conjugate  to  the  binomial  sampling  distribution. 

More  specifically,  the  prior  is  now  a  beta  distribution  with  parameters  (No  -ko  +  1,  h  +1),  and 
every  posterior  and  prior  after  this  will  be  also  be  beta.  Recall  that  in  tiiis  method,  ko  represents 
the  number  of  failures,  not  the  number  of  successes. 

The  event  (Ni  ,ki)ts  defined  as  the  results  of  the  new  sampling  test  conducted.  Also  define 
M  as  the  maximum  number  of  failures  allowed  in  the  sample  to  accept  the  lot.  The  goal  now  is  to 
find  an  equation  for  a ,  the  actual  producer’s  risk. 

Recall 

1 

a  =  Pr(  r  >  reject  j  =  J  /f  r/ reject )dr  (25) 


and  Bayes’  Theorem,  Eq  (7),  can  be  applied  to  show 

Pr(rejectlr)f(r) 


f  (rj  reject  )=■ 


Pr(  reject ) 


(26) 


24 


where  f(r)  is  just  the  prior  distribution  based  on  the  data  (No ,  h ),  and  Pr(reject  /  r)  =  Pr(k]  > 
M/r),  is  the  probability  that  more  than  M  failures  occurred  in  the  test  with  A^i  items  tested.  This 
latter  term  can  be  written  using  the  binomial  sampling  distribution  as 

Pr(  reject!  r  )=  X  f  ^  (27) 

Ihe  denominator  to  Eq  (26)  is  found  by  conditioning  on  r.  That  is 

1 

Pr(  reject )  =  J  Pr(  rejectjr)f(  r  )  (28) 

0 

and  then  substitute  the  prior  distribution  f(r),  which  is  a  B(No-lco+l,  ko+1),  and  Eq  (27)  into 
Eq(28). 

By  substituting  Eqs  (27)  and  (28)  and  the  prior  f(r)  back  into  Eq  (26)  and  integrating,  the 
producer’s  risk  can  be  found  (Wood,  1983).  The  result  is  quite  lengthy,  but  it  is  in  closed  form.  It 
is  a  function  of //(}  ,ko  rr^.Ni  ,ki  ,andM.  It  is  found  that  the  producer’s  risk  is  an  increasing 

function  of  Ni  and  a  decreasing  function  of  M. 

The  computation  of  the  consumer’s  risk  p  is  accomplished  in  the  same  way  as  the  producer’s 

risk.  Where  we  can  write 

^  =  Pr(r<r^l  accept  )=  J  firj  accept  )dr  =  j  (29) 

By  calculating  this,  it  turns  out  that  P  is  a  function  ofNo  ,ko  ,  ,Ni  ,ki  ,  and  M,  with  p 
being  a  decreasing  function  of  Ni  and  an  increasing  function  of  M. 
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Now  with  the  actual  risks  a  and  (3  defined,  the  number  of  samples  and  maximum  failures 
allowed  for  acceptance,  Ni  and  M  respectively,  are  solved  such  that  ci.<a^  and  p  ^  p  .  The 
algorithm  to  solve  for  Ni  and  M  starts  by  setting  M  =  0  and  increasing  Ni  (to  reduce  value  of  p ) 
until  P  ^  P;„ .  Now  at  this  point  (Ni ,  M),  it  is  determined  ifa<a^.  Ifa<a^,  then  (Ni ,  M)  is 
the  solution.  Otherwise,  M  is  increased  (to  reduce  value  of  a )  until  a  <  and  then  the 
algorithm  is  started  over.  This  iterative  scheme  wiU  result  in  the  smallest  Ni  with  the  smallest  M, 
such  that  a  <a^  and  p  <  p^. 

The  sampling  plan  is  then  to  take  a  sample  of  size  Ni  and  accept  the  lot  if  the  number  of 
failures  it;  <  M  and  reject  the  lot  if  kj  >  M.  Once  testing  is  accomplished  and  ki  is  known,  the 
actual  risks  can  be  determined  and  a  new  prior  can  be  updated  for  the  next  round  of  sampling  if 
needed. 

2.8.2  Combined  Bayesian,  Sample  Theoretic  Approach. 

Launer  and  Singpurwalla  give  a  Bayesian  approach  that  is  used  to  monitor  the  decay  in  the 
reliability  of  an  arsenal  (Launer  and  Singpurwalla,  1986).  The  method  works  on  the  premise  that 
testing  is  expensive  and  destructive,  so  there  is  a  strong  desire  to  minimize  the  sample  size. 

Since  reliability  changes  over  time,  sampling  is  done  on  a  time-interval  basis,  i.e.,  annually. 
They  define:  Wt .  to  be  the  number  of  items  to  be  tested  in  time  period  t;  Xt ,  the  number  of  items 
tested  successfully  in  time  t;  and  pt ,  the  probability  the  item  tested  in  time  t  results  in  a  success. 
The  results  are  given  as  pass  /  failure,  so  we  use  a  binomial  distribution,  pt  is  a  random  variable 
given  a  prior  density  function,  g(p),  and  «t  is  a  decision  variable  that  needs  to  be  minimized.  The 
equations  used  to  solve  for  «t  are  derived  as  follows. 
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When  pt  is  large,  intuitively  the  number  of  failures  in  a  sample  of  size  n,  would  be  expected 
to  be  small.  Given  rh  and  pt ,  let  Xt*  be  the  largest  integer  such  that  the  chance  of  observing  Xt*  or 
fewer  successes  is  small,  say  a .  Then  we  have 


Pr{x,*  or  fewer  successes  in  rk  tests  I  pi) 


Pid-Pt 


<a. 


(30) 


Then  if  p,  changes  to  pt  -  A ,  with  A  large,  then  intuitively,  the  number  of  failures  in  a  sample 
would  be  large,  and  if  A  is  small,  the  number  of  failures  in  a  sample  would  still  tend  to  be  small. 
A  change  of  A  should  be  detected  with  probability  it,  where  A  andn  are  determined  by  the 
decision  makers.  Let  the  consumers  risk  B  =  1  -  n,  then 


Pr{x,*  or  fewer  successes  in  «t  tests  I  pt  -  A } 


>1-B 


(31) 


In  Eqs  (30)  and  (3 1),  p, ,  a,  B ,  and  A  are  known,  and  iti  and  x,*  need  to  be  obtained  by  solving  Eqs 
(30)  and  (31)  simultaneously.  Eq  (30)  is  then  used  to  determine  if  the  reliability  of  the  items  being 
sampled  at  time  t  is  still  pt ,  with  a  Type  I  error  a .  If  Xt  >  Xt*,  we  assume  the  reliability  of  the 
items  has  not  changed. 

Since  we  do  not  know  the  value  of  pt,  a  prior  distribution  is  assigned  to  it  as  mentioned 
previously.  Since  pt  is  a  random  variable  with  a  distribution  g(p, ),  Eqs  (30)  and  (31)  must  be 
conditioned  on  pt.  These  equations  then  become 


(32) 


27 


and 


JE  ”;  \pt-^)Ul-Pt  +Ar‘-^g(Pt)dPt  ^1-B  (33) 

0  j=0^  •'  ^ 

where  these  equations  need  to  be  solved  simultaneously  to  obtain  rit  and  Xt*. 

To  implement  this  procedure,  we  define  a  prior  distribution  and  then  sample  a  population  of 
items.  If  the  sample  indicates  the  population’s  reliability  did  not  decay,  (i.e.  Xt  >  Xi*)  the  prior  is 
updated  with  the  sample  results.  If  the  sample  indicates  that  the  population’s  reliability  did  decay 
(i.e.  jct  <  Xt*),  then  the  population  is  rejected. 

As  an  alternative  approach,  it  is  suggested  to  replace  pt  in  Eqs  (30)  and  (31)  by  the  modal 
value  of  g(pi)-  The  modal  value  is  the  most  likely  value  of  pt ,  a  single  number.  This  would  make 
the  calculations  easier  and  the  number  of  items  to  be  sampled  would  be  less,  since  all  the 
probabilities  are  concentrated  on  a  single  value.  The  paper  does  not  give  any  indication  though,  as 
to  the  sensitivity  of  replacing  the  prior  distribution  of  pt  by  its  modal  value. 

In  a  paper  prior  to  this  one,  a  sequential  sampling  technique  is  introduced  by  the  same  authors 
that  uses  the  same  equations  (Singpurwalla  and  Launer,  1984:1-17).  However  this  time,  testing  is 
done  one  item  at  a  tune  and  after  each  item  is  tested,  it  is  determined  whether  to  accept  or  reject  the 
population,  or  continue  testing  more  items.  The  maximum  number  of  items  tested  would  be  no 
larger  than  nt  determined  by  solving  Eqs  (32)  and  (33).  With  this  sequential  sampUng  scheme,  the 
expected  number  of  items  to  be  tested  E[n,  I  pj  can  be  determined. 

Given  rit  and  Xt*,  the  probability  that  n,  =  x  is  determined  by 
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Print  =x/pt)  = 


x-1  ^ 


I J 

;<  j)^ - f Pf‘*‘ ■’‘Uxin,. 


(34) 


Again,  pt  is  replaced  by  the  prior  distribution  g(p, )  which  is  p(Y,5) ,  and  integrated  with  respect  to 
Pt  to  obtain 


Print  =x)- 


x—1 


r('Y+6)  Tix-nt+x*  +y )Tint-Xt  +d) 


[nt  -xt  -l)Tiy)ri8)  rfY+5+;c) 

x-1  ^  r('Y+5)  Tjx-nt+x* +y )rint-x* +5)  ^ 

nt-x*-l)Tiy)Ti5)  rfY+S+Jc) 

x-1  ^  FCy+B)  Tix*  +l+y )rix-x*t  -1+5) 


»  * 
-Xt  <X<Xi 


(35) 


.X-Xt 


IJnyJTid) 


r(Y+s+jc) 


Xt  <x<nt. 


Then  E[nt  ]  can  easily  be  determined  by  summing  Print  =  x)ix)  for  all  values  of  x  from  x  =  Xt*+  1 


to  X  =  n,.  . 

Provided  that  those  testing  the  samples  have  the  capability  of  testing  sequentially,  a  savings  in 
the  number  of  items  tested  is  recognized. 


2.9  Sequential  Sampling. 

Whereas  the  Bayesian  methods  dealt  with  attribute  sampling,  the  Sequential  methods  will  deal 
with  variable  sampling.  As  mentioned  at  the  beginning  of  this  chapter,  it  was  discovered  halfway 
throughout  the  thesis  project  that  we  would  be  given  variable  data  rather  than  attribute  data. 

In  sequential  sampling,  the  sample  size  is  not  fixed  as  in  single  sampling  plans,  but  rather  it  is 
a  random  variable.  Sequential  sampling  allows  the  possibility  of  a  reduced  sample  without  the 
consumer’s  or  producer’s  risks  increasing.  This  is  accomplished  by  testing  one  unit  from  a  sample 
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at  a  time,  and  immediately  detennining  if  enough  information  is  available  to  either  accept  or  reject 
a  population.  After  a  test  of  an  item,  one  of  three  possible  actions  may  occur:  (1)  there  is  enough 
information  to  accept  the  lot;  (2)  there  is  enough  information  to  reject  the  lot;  or  (3)  continue  the 
testing  since  enough  information  has  not  been  acquired  to  either  accept  or  reject  the  lot. 

With  sequential  sampling,  the  test  will  always  tenninate  and  will  seldom  use  more  sample 
units  for  testing  than  is  required  with  single  sampling  methods.  The  expected  number  of  units  to  be 
tested  in  sequential  sampling,  E(N),  will  be  smaller  than  the  fixed  sample  size  in  single-sampling 
methods.  Since  it  has  been  determined  that  we  will  have  variable  test  data  from  the  results  of  the 
sampling,  we  will  focus  on  sequential  sampling  that  uses  variable  data  as  opposed  to  attribute  data. 
For  a  more  complete  coverage  of  sequential  sampling  and  the  theory  behind  it,  see  (Mann,  Schafer 
and  SingpurwaUa,  1974)  or  (Kapur  and  Lamberson,  1977). 

2.9.1  Sequential  Sampling  from  a  Normal  Distribution. 

Assuming  the  variable  test  results  of  the  chemical  suits  follow  a  normal  distribution,  a 
sequential  sampling  plan  can  easily  be  implemented.  We  will  test  the  hypothesis  that  the  trae  mean 
of  the  suits  reUabiUty  U  is  greater  than  some  value  Uq.  (/i  is  defined  as  the  value  at  which  the 
analyst  wishes  only  a  p  probability  of  erroneously  accepting  the  test  hypothesis  when  U  =Ui  . 
Using  the  conventional  definitions  of  a  and  P ,  we  come  up  with  the  following  sequential  sampling 
plan: 


Ho:  U  >Uo  (Meets  requirement) 

Hi:  U  <Uo  (Does  not  meet  requirement) 

Given  s  -  (36) 

2 
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(37) 


m 

1)  AcceptHo:  U>Uo,  if^X;  >ho  +ms 

i=l 


where  ho  = 


Uj-Uo 


In 


P 

1-a 


(38) 


m 

2)  RejectHo:  U>Uo,  <hj  +ms  (39) 

i=l 

2 

where  hj=  — - In  - — ^  (40) 

U 2  ^Uq  CC 

m 

3)  Take  an  additional  observation  if  hj  +  ms  x,-  <ho+ms  (41) 

i=i 

where  m  is  the  niraiber  of  items  sampled  and  Xi  is  the  value  of  the  ith  sample  (Mace,  1973: 
132-135). 

2.10  Other  Approaches  to  Binomial  Acceptance  Sampling  with  Destructive  T esting 

While  all  of  the  above  methods  utilized  Bayesian  or  sequential  sampling  methods  in  their 
approaches,  two  methods  were  found  that  dealt  with  small  sample  sizes  and  destractive  testing  that 
did  not  use  Bayesian  or  sequential  sampling.  These  two  methods  are  given  since  they  contain  some 
ideas  that  may  be  utilized  in  the  surveillance  plan  for  the  chemical  defense  suits. 

2.10.1  Chain  Sampling. 

In  Baker  and  Thomas’s  paper,  chain  sampling  is  used  to  develop  an  attribute  acceptance 
sampling  plan  for  expensive  armor  packages  with  destmctive  testing  (Baker  and  Thomas, 
1992:213-223).  Chain  sampling  utilizes  information  over  a  series  of  lots.  Instead  of  a  single 
sampling  plan  with  a  sample  of  size  n,  where  in  this  case,  one  failure  results  in  the  rejection  of  the 
lot,  chain  sampling  utilizes  information  from  the  prior  J  lots.  This  method  protects  a  lot  from 
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rejection  if  a  failure  occurs  in  its  sample,  provided  no  failures  have  occurred  in  the  immediately 
preceding;  lots  (Juran,  1988:25.37).  In  other  words,  if  the  prior;  lots  have  no  failures  in  then- 
samples  and  a  failure  occurs  in  the  present  lot,  it  is  not  rejected  as  it  would  be  under  single 
sampling.  It  is  particularly  useful  for  small  samples  and  protects  lots  from  being  rejected  as  a 
result  of  an  anomaly  of  one  bad  item  in  a  sample. 

2.10.2  Methodology  Using  a  Deteriorating  Reliability  Function. 

Bain  and  Engelhardt  developed  an  attribute  sampling  plan  for  one-shot  devices  with  assumed 
deteriorating  reliability  (Bain  and  Engelhardt,  1991:304-31 1).  The  methodology  does  not  use  prior 
information,  but  does  rely  on  the  result  of  earlier  samples  as  the  sampling  continues  through  time. 
The  methodology  also  assumes  a  function  q(t)  which  is  the  probability  that  a  device  fails  to  work  at 
limp,  t,  and  samples  are  taken  on  a  regular  time  interval,  such  as  annually.  Trials  of  items  in  the 
sample  are  pass  /  fail,  and  the  methodology  uses  a  binomial  sampling  distribution. 

The  methodology  assumes  a  WeibuU  degradation  function  q(t)  where  the  shape  parameter  is 
assumed  known,  but  the  scale  parameter  is  unknown.  At  time  ti ,  the  first  sample  is  drawn.  Since 
no  prior  information  is  used,  classical  statistics  are  used  for  choosing  the  first  sample  size.  It 
assumes  no  failures  will  be  observed,  so  a  sample  size  is  found  solving  bfO;  ni  ,q*)=  a,  where 
b(x;  n  ,q)  is  the  binomial  pdf  with  parameters  n  and  q,  and  a  is  the  producer’s  risk,  q*  is  the 
maximum  allowed  probability  for  failure  of  items  in  the  lot. 

The  next  period’s  sample  size  is  found  solving  for  «2  the  following  equation: 

8(0;  ni ,  q(ti))B(0;  n2  ,q*)=  a  (42) 

and  solving  for  n^  in  general  for  k  independent  samples  from  k  periods 
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(43) 


k-1 

B(0;  Hk ,  q*)  n  B(0;  njq(  tj  ))  =  a 
i=l 

which  is  a  recursive  formula,  resulting  in  less  items  sampled  as  the  number  of  periods  sampled 
increases. 

The  parameters  of  q{t)  are  updated  at  each  kth  stage  of  testing,  so  that  q(tk)  =  q*.  The  key 
factor  in  this  methodology  is  finding  a  function  that  models  the  degradation  of  the  reliability  of  the 
items  being  sampled. 

2.11  Summary 

In  this  literature  review  we  gave  a  brief  introduction  to  sampling  and  a  review  of  Bayesian 
statistics  for  those  not  famUiar  with  these  two  subjects.  We  then  gave  current  methods  for 
selecting  a  prior  distribution  for  binomial  sampling.  Two  Bayesian  methodologies  for  sampling 
from  a  population  with  deteriorating  reliabihty  and  destructive  testing  that  utilized  attribute  data 
were  given.  This  was  followed  by  an  introduction  to  sequential  sampling  and  a  method  for 
sequential  sampling  using  variable  data  assuming  a  normally  distributed  population.  The  literature 
review  concluded  with  a  couple  of  methodologies  used  to  sample  from  populations  where 
destructive  testing  was  the  only  method  for  sampling. 
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3.  Methodology 


3.1  Introduction 

This  section  introduces  the  methodology  used  in  developing  a  sampling  plan  for  the  chemical 
defense  coveralls.  The  first  part  of  this  section  examines  the  assumptions  made  and  current 
concepts  held  by  the  Human  Systems  Program  Office  at  Brooks  AFB.  The  second  portion  of  this 
section  is  aimed  at  analyzing  and  evaluating  the  Air  Force’s  current  sampling  plan  for  the  chemical 
defense  suits.  Once  a  thorough  understanding  is  gained  from  the  current  sampling  plan, 
methodologies  are  developed  to  improve  the  current  sampling  plan  and  presented  in  the  third  part 
of  this  section.  The  last  part  of  the  methodology  gives  a  description  of  the  simulation  methods  used 
to  compare  the  various  methodologies  introduced  in  this  section. 

3.2  Assumptions 

The  program  managers  at  the  Human  Systems  Program  Office  gave  the  requirement  for  a 
95%  confidence  level  for  sampling.  That  is,  their  a  value  is  .05.  The  program  managers  also 
indicated  that  they  do  not  assume  there  will  be  any  noticeable  difference  between  lots  and  they 
assume  the  degradation  of  suits  in  different  lots  wfil  occur  at  the  same  rate.  Their  current  sampling 
plan  is  not  designed  to  detect  a  degradation  in  performance  of  a  single  lot,  but  rather,  it  is  designed 
to  detect  the  degradation  in  the  suits’  protective  capability  as  a  whole  population. 

The  Human  Systems  Program  Office  and  Natick  agreed  to  run  a  series  of  eight  tests 
measuring  eight  different  characteristics  on  each  suit.  These  tests  are  given  in  Table  2.  In  our 
methodology,  we  did  not  include  the  Laundering  test  and  the  Packaging  test,  since  these  are  simply 
observational  pass/fail  tests  and  the  test  managers  did  not  feel  they  would  be  a  factor  in 
determining  whether  suits  were  passed  or  failed..  This  left  us  with  six  tests.  We  further  divided 
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three  of  the  tests,  Chemical  Adsorption,  Spray  Rating,  and  Dynamic  Adsorption,  into  two  tests 
each  since  they  involve  laundered/unlaundered  tests.  Two  of  the  tests.  Breaking  Strength  and  Tear 
Strength,  are  also  divided  into  two  tests  each,  since  each  test  actually  involves  two  tests,  a  warp 
test  and  a  fill  test.  This  further  dividing  of  tests  leaves  us  with  a  total  of  1 1  tests. 


Table  2.  Surveillance  Tests  for  Chemical  Defense  Coveralls 


Characteristic 

Requirement 

Description 

Packaging 

N/A 

Result  of  test  is  pass  /  failure 

Chemical 

Adsorption 

To  be  determined 

Most  important  characteristic  of  suit.  2/3  of  samples  will  be  laundered, 

1/3  not.  3  swatches  sampled  per  coverall.  Score  for  test  is  average  of 

3  swatches. 

Breaking  Strength 

Minimum  Values: 

Warp:  160  lbs 

Filling:  100  lbs 

Test  finds  value  at  which  material  breaks.  Warp  value  is  strength  of 
material  running  parallel  with  fiber,  Filling  Is  strength  of  material  running 
perpendicular  to  fiber.  10  swatches  sampled  per  coverall  (5  per  test). 
Score  is  average  of  5  swatches. 

Tearing  Strength 

Minimum  Values: 

Warp:  6  lbs  Filling: 

4  lbs 

Test  finds  value  at  which  material  tears.  2  swatches  sampled  per 
coverall,  1  for  warp  test  and  1  for  filling  test. 

Seam  Strength 

Minimum  Value:  70  lbs 

Tests  strength  of  seams  to  breakage.  1  swatch  per  coverall. 

Laundering 

Durability 

N/A 

Result  of  test  is  pass  /  failure.  Tests  how  well  material  stands  up  to 
laundering. 

Water  Resistance  - 
Spray  Rating 

Minimum  values  to  be 
determined 

Measures  time  it  takes  water  to  soak  through  material.  2/3  of  suits 
tested  are  laundered,  other  1/3  are  not.  3  swatches  per  coverall.  Score 
is  average  of  3  swatches. 

Water  Resistance  - 
Dynamic 

Adsorption 

Minimum  Values  to  be 
determined 

Measures  amount  of  water  material  soaks  up  when  immersed  in  water. 
2/3  of  suits  tested  are  laundered,  other  1/3  are  not.  2  swatches  per 
coverall.  Score  is  average  of  2  swatches. 

With  these  tests  we  made  the  assumption  that  the  test  results  are  normally  distributed.  We 
feel  this  is  an  adequate  approximation.  In  the  past,  the  army  has  found  that  the  results  of  their  tests 
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on  chemical  suits  usually  center  around  a  central  mean  with  nearly  equal  amount  of  results  falling 
above  and  below  the  mean.  The  army  also  ran  a  goodness-of-fit  test  on  their  chemical  adsorption 
test  data  and  the  results  indicated  that  a  normal  distribution  was  appropriate  for  modeling  the  data 
(Army  Materiel  Systems  Analysis  Activity  memorandum,  1987).  Assuming  a  normal  distribution 
will  allow  us  to  use  statistical  methods  that  assume  a  normal  distribution. 

While  each  of  the  tests  run  on  the  suits  have  to  meet  minimum  requirements,  the  program 
managers  have  not  yet  set  statistical  measures  or  criteria  as  when  to  say  that  a  population  of 
chemical  suits  is  no  longer  effective.  This  is  a  judgment  call  made  by  the  decision  maker,  that 
takes  into  consideration  costs,  age  of  suits,  ability  to  replace  existing  suits,  and  which  tests  the 
suits  have  failed  among  other  factors.  For  example,  if  the  suits  fail  to  meet  the  minimum 
requirement  for  the  Chemical  Adsorption  test,  this  would  tend  to  be  more  serious  than  if  the  suits 
failed  to  pass  the  Breaking  Strength  test.  While  we  feel  that  the  Chemical  Adsorption  test  is  the 
most  critical  test,  the  program  managers  at  Aeronautical  Systems  Center  were  reluctant  to  commit 
to  any  type  of  weighting  of  the  tests  in  terms  of  the  tests  relative  importance  in  rejecting  or 
accepting  the  suits.  We  will  now  take  a  look  at  the  sampling  plan  proposed  by  the  Human  Systems 
Program  Office. 

3.3  Original  Sampling  Plan 

As  mentioned  earlier,  the  Air  Force  has  no  statistical  tests  to  help  make  a  decision  with  its 
sampling  plan,  but  instead  makes  a  judgment  call  as  to  when  a  population  of  suits  has  degraded 
enough  to  reject.  In  order  to  compare  their  current  sampling  plan  to  proposed  sampling  plans,  it  is 
necessary  to  model  the  current  sampling  plan  as  a  statistical  test  in  which  the  measured 
characteristic  of  the  suit  is  either  rejected  or  accepted  based  upon  the  test  results.  Note  that  we  are 
modeling  statistical  decisions  to  accept  or  reject  based  upon  sampling  results  and  comparing  this 
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information  to  other  sampling  plans.  We  are  not  attempting  to  model  the  decision  maker's  decision 
process. 

The  first  thing  we  did  was  to  gather  data  from  the  Army’s  past  tests  on  its  chemical  protective 
clothing.  While  it  was  pointed  out  by  the  testing  analysts  at  Natick  that  the  Army’s  suits  and  the 
Air  Force’s  suits  were  made  out  of  different  materials,  we  felt  the  Army’s  data  would  be  a  good 
start.  We  used  the  Army’s  mean  values  and  standard  deviations  of  the  test  results.  For  the  Seam 
Strength  test,  a  test  that  the  Army  did  not  use  on  its  suits,  we  set  a  hypothetical  mean  of  100  for  the 
test  and  a  standard  deviation  of  10.  The  minimum  requirement  for  this  test  was  set  at  a  value  of 
70.  We  picked  these  values  since  they  are  reasonable  values  as  far  as  the  standard  deviation  being 
10%  of  the  mean  and  the  minimum  requirement  for  the  test  being  70%  of  the  current  mean  value. 
These  values  are  similar  to  what  we  have  seen  with  the  other  tests. 

We  set  up  statistical  tests  for  the  1 1  tests  that  would  be  run  on  the  samples.  We  will  run  each 
of  the  eleven  tests  independent  of  each  oflier.  The  hypothesis  tests  we  used  are  defined  as  the 
following: 

Ho:  The  mean  value  of  a  population’s  test  parameter  >  minimum  requirement.  (Meets  the 

requirement). 

H,:  The  mean  value  of  a  population’s  test  parameter  <  the  minimum  requirement.  (Does 

not  meet  the  requirement). 

Note  that  in  the  case  of  Dynamic  Adsorption,  the  actual  requirement  is  a  maximum  requirement, 
while  all  of  the  other  tests  have  minimum  requirements,  so  the  appropriate  changes  in  the  tests  are 
made. 

For  the  tests,  the  Human  Systems  Program  Office  set  a  =  .05  with  no  mention  of  any  value 
for  P .  We  decided  to  set  p  =  .10,  which  is  a  reasonable  choice.  Recall  that  a  Type  11  error  is  the 
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probability  that  a  population  is  accepted,  when  in  fact,  the  population  should  have  been  rejected, 
pis  the  probability  that  a  Type  II  error  will  occur.  The  actual  statistical  test  is  a  test  that  is  found 
in  most  statistics  books  and  is  now  presented.  (Mendenhall,  Wackerly,  and  Scheaffer,  1990: 434- 
455). 

With  a  large  sample  size,  the  mean  value  of  the  results  from  the  sampled  tests  will  approach  a 
noraial  distribution  regardless  of  the  distribution  from  which  the  sample  is  drawn.  Some  texts 
(Mendenhall,  Wackerly,  and  Scheaffer,  1990:  319)  state  that  30  samples  suffice  as  a  large  sample 
size,  while  most  other  texts  say  that  it  takes  100  samples  to  be  regarded  as  a  large  enough  sample. 
This  property  is  from  the  widely  known  Central  Limit  Theorem.  We  will  assume  the  results  of  the 
eleven  tests  can  be  modeled  as  normal  distributions.  Let  U  be  the  actual  population  average  and 
Ua  be  the  minimum  requirement  for  the  results  of  a  specific  test.  We  run  the  following  test: 

Ho:  U=Uo  (Meets  requirement). 

Ha:  U<Uo  (Does  not  meet  requirement). 

Y _ ^ 

Test  statistic:  T= - ^  (44) 

SI  4n 

where  S  is  the  sample  standard  deviation  or  known  standard  deviation  and  n  is  the  number  of 
samples.  F  is  the  average  of  the  test  results  that  have  been  sampled.  The  null  hypothesis  is  then 
rejected  if: 

T<-ta, 

where  the  t-value  has  n-1  degrees  of  freedom.  This  statistical  test  was  set  up  for  aU  eleven  tests  of 
the  samples  with  a=  .05.  Solving  the  above  inequality  for  Y  ,  the  appropriate  rejection  regions 
(rr)  are  given  in  Table  3  for  the  values  of  f .  If  the  value  of  f  falls  below  the  reject  region,  the 
null  hypothesis  Ho  is  rejected. 
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Table  3.  Rejection  Regions,  Condition  I 


TEST 

MEAN 

STD 

DEV 

MIN 

REQMNT 

NUMBER 

OF 

SAMPLE 

Reject 

Value 

Break 

Strength  (W) 

246 

27.22 

190 

30 

181.8 

Break 

Strength  (F) 

159.17 

9.73 

115 

30 

112.1 

Tear 

Strength  (W) 

10.5 

1.342 

10 

30 

9.597 

Tear 

Strength  (F) 

■m 

1.342 

7 

30 

6.597 

Seam 

Strength 

100 

10 

70 

30 

67 

Spray 

Rating 

(Laundered) 

(Unlaundered) 

98.17 

8.68 

90 

20 

10 

86.64 

84.97 

Dynamic 

Adsorption 

(Laundered) 

13.7 

9.55 

20  (MAX) 

20 

23.69 

(Unlaundered) 

10 

25.54 

Chemical 

Adsorption 

(Laundered) 

2.32 

0.29 

1.3 

20 

1.188 

(Unlaundered) 

10 

1.132 

While  these  tests  give  Type  1  errors  of  .05,  nothing  is  mentioned  of  Type  II  errors.  Recall  we 
wish  to  set  P=  .10.  With  Type  II  errors,  it  is  only  possible  to  calculate  the  actual  P  value  at  a 
fixed  point.  That  is,  at  some  fixed  point  below  the  minimum  requirement  of  the  tests,  we  want  only 
a  .10  probability  of  accepting  a  degraded  population  of  suits.  The  fixed  point  Ui  was  established 
for  each  test  by  simply  taking  95%  of  the  minimum  requirement  of  the  test.  We  are  thus  saying. 
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we  would  like  only  a .  10  probability  of  accepting  a  suit  when  its  test  value  has  degraded  to  95%  of 
the  minimum  requirement. 

In  addition  to  these  choices  for  C/o  and  Ui ,  which  we  will  call  Condition  1,  we  did  a  separate 
analysis  where  the  starting  means  of  the  tests  were  chosen  as  t/o  and  the  minimum  requirements 
for  the  tests  were  chosen  as  the  values  for  U i .  We  wiU  call  these  values  Condition  11.  The 
Condition  II  values  give  a  higher  value  for  the  rejection  region.  These  new  values  state  that  when 
we  are  at  the  starting  mean  value  for  one  of  the  tested  criteria  of  the  suits,  we  want  to  accept  the 
suits  95%  of  the  time,  but  when  the  suits  degrade  to  the  minimum  requirement,  we  want  to  reject 
the  suits  90%  of  the  time.  Obviously,  this  puts  a  tighter  requirement  on  the  suits  and  we  would 
expect  to  reject  the  suits  earlier  in  their  lifetime. 

The  actual  value  of  p  can  be  calculated  by  finding  the  probability  that  the  f  >  rejection 
region  when  the  true  mean  of  the  tests  U=Ui.  This  is  accomplished  by  finding  the  probability 
rr 

that  t„  ^ - 7^  where  rr  is  the  rejection  region.  This  is  found  by  finding  this  value  on  an 

sl4n 

inverse  T-table,  where  has  n-1  degrees  of  fteedom. 

The  most  widely  used  way  to  obtain  the  needed  P  value  if  the  actual  P  value  is  greater  than 
desired  is  to  increase  the  number  of  samples.  Increasing  the  number  of  samples  will  always  reduce 
the  a  and  P  risk.  But  since  the  samples  are  destractive  and  expensive,  alternative  ways  to  bring 
down  the  Type  II  error  without  drastically  increasing  the  sample  sizes  are  desired.  We  will  now 
propose  various  sampling  methods  that  will  reduce  the  number  of  samples  while  maintaining  the 
needed  a  and  p  requirements. 
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3.4  Pre-posturing  Sampling 

This  method  of  sampling  is  very  similar  to  the  original  method  of  sampling.  The  difference  is 
instead  of  sampling  30  suits  each  year,  fewer  suits  are  sampled  in  the  earlier  years  of  the  sampling 
plan  and  more  suits  sampled  in  the  later  years.  Those  suits  that  have  been  saved  from  sampling 
fewer  suits  in  the  earlier  years  are  then  used  for  sampling  in  the  later  years  of  the  tests,  so  that 
more  suits  are  tested  as  the  suits  are  getting  older.  Over  the  period  of  16  years  of  sampling,  the 
total  amount  of  suits  sampled  is  the  same  as  in  the  original  sampling  plan. 

The  idea  behind  testing  fewer  suits  in  the  earlier  years  and  more  suits  in  the  later  years  is  that 
in  the  earlier  years,  the  actual  means  of  the  suits'  characteristic  are  probably  far  above  that  of  the 
minimum  requirement.  Testing  30  suits  in  the  first  few  years  would  be  far  in  excess  of  the  number 
of  suits  needed  to  meet  the  a  and  P  probability  demands.  On  the  other  hand,  as  the  actual  means 
of  the  suits’  characteristics  approach  the  minimum  requirements,  30  suits  is  not  enough  to  give  a 
very  smaU  p  value.  By  increasing  the  amount  of  suits  tested  at  this  time,  the  actual  p  value  will 
decrease.  Table  4  lists  the  number  of  suits  to  be  tested  each  year  with  this  methodology. 


Table  4.  Pre-Postuiing  Sample  Plan 


1  YEAR  1 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

SUITS 

SAMPLED 

12 

12 

21 

21 

30 

30 

39 

39 

48 

48 

30 

30 

The  designed  life  of  the  chemical  suits  is  ten  years.  Since  the  test  managers  are  actually 
expecting  a  longer  lifetime  for  the  suits,  the  number  of  tests  to  be  sampled  increases  after  year  ten. 
We  assume  we  still  have  a  finite  value  of  360  suits  to  be  used  as  samples.  The  number  of  samples 
decrease  in  year  15  and  16  since  the  probability  of  the  suits  surviving  this  long  may  be  a  little 
optimistic.  Potential  test  infomaation  would  be  wasted  if  we  postured  most  of  the  suits  to  be 
sampled  in  years  15  and  16  and  the  suits  were  rejected  in  year  12.  The  success  of  this  sampling 
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method  relies  on  making  an  accurate  prediction  as  to  what  year  most  of  the  suits’  tested 
characteristics  degrade  to  values  exactly  between  Uo  and  Ui ,  the  values  in  which  the  most  suits 
will  be  needed  for  testing. 

3.5  Sequential  Sampling 

Sequential  sampling  allows  the  person  running  the  tests  to  stop  taking  samples  when  a 
decision  has  been  statistically  made  as  to  whether  to  accept  or  reject  the  testing  of  suits.  Again,  we 
are  assuming  the  suits’  test  results  are  nonnally  distributed.  The  sequential  sampling  for  a  normal 
population  was  presented  in  Chapter  II.  We  will  again  assume  that  a  =.05  and  p  =.10  as  we  have 
in  the  two  previous  methodologies. 

The  sequential  sampling  concept  is  slightly  similar  to  the  pre-posturing  method.  When  the 
actual  means  of  the  characteristics  of  the  suits  are  far  above  the  minimum,  sequential  sampling  will 
allow  for  very  few  samples  to  be  drawn.  When  the  actual  means  are  close  to  the  minimum 
requirements,  more  samples  are  needed  to  ensure  that  a  and  P  are  less  than  or  equal  to  .05  and .  10 
respectively.  Each  of  the  eleven  tests  are  still  treated  independently  of  each  other.  A  separate 
sequential  sampling  plan  is  used  on  each  test.  Figure  3  gives  a  graphical  presentation  of  sequential 
sampling. 

In  Figure  3,  the  horizontal  axis  represents  the  number  of  samples  taken  and  the  vertical  axis 
represents  the  sum  of  the  results  of  the  samples.  The  sum  of  the  test  results  are  plotted  versus  the 
number  of  samples.  If  the  plots  go  into  the  Reject  Region,  the  population  is  rejected  and  sampling 
ceases.  If  the  plots  go  into  the  Accept  Region,  as  indicated  in  this  figure,  the  population  is 
accepted  and  sampling  ceases.  As  long  as  the  plots  remain  between  the  Accept  and  Reject  regions, 
sampling  will  continue.  For  our  tests,  we  will  assume  that  we  can  sample  an  infinite  number  of 
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suits,  so  we  will  run  all  sequential  tests  until  we  have  made  either  an  accept  or  reject  decision  for 
that  particular  characteristic. 


3.6  Aggregated  Sequential  Sampling 

One  downfall  of  sequential  sampling  is  that  with  eleven  different  tested  characteristics,  one  of 
the  tested  characteristics  may  drive  up  the  number  of  suits  to  be  sampled  while  all  other 
characteristics  may  be  finished  sampling  after  four  or  five  suits.  Since  a  whole  suit  must  be 
destroyed  for  even  one  test,  unbounded  sequential  sampling  may  not  save  that  many  suits  in  the 
long  run.  This  problem  is  what  drives  this  method  of  Aggregated  Sequential  Sampling. 

This  method  of  sequential  sampling  weights  the  values  of  the  eleven  tests  so  that  the  weights 
sum  to  one.  The  results  of  the  eleven  tests  are  then  combined  to  form  one  test  result.  In  this  way, 
the  decision  maker  can  deem  which  measured  characteristics  are  most  important  and  assign  these 
tested  values  the  most  weight. 
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The  first  step  in  this  methodology  is  to  standardize  the  tests'  results.  This  is  done  by 
subtracting  the  minimum  requirement  fiom  the  actual  test  result,  and  then  dividing  this  difference 
by  the  standard  deviation  for  that  particular  test.  By  doing  this  we  then  have  1 1  normal 
distributions  with  standard  deviations  of  one. 

Each  of  these  test  results  is  then  weighted  so  that  the  eleven  weights  sum  to  one.  These  eleven 
weighted  standardized  test  results  are  then  added  together  to  form  a  single  value.  By  adding  these 
eleven  weighted  normal  results  together,  the  resulting  distribution  of  these  sums  is  still  a  normal 
distribution.  The  variance  of  this  summed  normal  distribution  can  be  found  by  taking  the  sums  of 
the  squares  of  the  eleven  weights  (Law  and  Kelton,  1994:336). 

To  give  some  insight  as  to  what  the  standardizing  and  weighting  of  the  tests  wUl  do,  we 
present  an  example.  If  all  eleven  test  results  were  exactly  at  their  minimum  requirement,  each 
individual  standardized  score  would  be  zero.  The  sum  of  the  weighted  standardized  values  would 
also  be  zero,  regardless  of  the  choices  for  the  weights.  Now  if  ten  of  the  eleven  results  were 
exactly  at  their  minimum  requirement  but  the  one  other  result  was  slightly  below  its  minimum 
requirement,  then  the  sum  of  these  weighted  tests  would  be  slightly  less  than  zero.  If  the  same  were 
tme  that  ten  of  the  eleven  test  results  were  at  the  minimum  requirement  but  the  one  other  result  was 
slightly  above  the  minimum  requirement,  then  the  sum  of  the  values  would  be  slightly  greater  than 
zero.  Therefore,  if  the  sum  of  the  weighted  standardized  results  is  greater  than  zero,  we  would 
want  to  accept  the  suit,  and  if  the  sum  of  the  weighted  standardized  tests  is  less  than  zero,  we 
would  want  to  reject  the  suit.  Our  Uo  value  for  the  aggregated  test  is  then  0.  That  is,  if  the  trae 
mean  of  the  sum  of  the  weighted  standardized  test  results  is  truly  0,  we  would  want  to  accept  the 
suits  95%  of  the  time. 

Intuitively,  if  a  heavily-weighted  test  result  was  below  the  minimum  requirement,  it  would 
take  many  other  less-weighted  test  results  above  their  minimum  requirement  in  order  to  pass  a  suit. 
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In  the  same  manner,  if  a  less-weighted  test  result  was  below  the  minimum  requirement,  it  would  not 
have  much  effect  in  rejecting  a  suit  due  to  its  low  weight. 

The  value  for  computing  the  type  II  error  p  is  found  by  taking  the  value  Ux  from  each  of  the 
1 1  tests  (before  standardizing)  and  finding  its  standardized  value  by  standardizing  this  Ux  value 
using  the  method  given  above.  For  example,  the  suit  criteria  Seam  Strength  has  a  minimum 
requirement  of  70,  which  is  its  Uo  value,  and  95%  of  70  is  66.5,  which  is  the  value  for  Ux .  Its 
standard  deviation  is  10.  Its  standardized  Ux  value  can  be  computed  as  (66.5-70)/10  =  -.35.  See 
Figures  4  and  5  for  a  visual  representation  of  the  mean  and  C/o  and  Ux  values  for  the  Seam  Strength 
test  before  and  after  the  values  were  standardized. 


Figure  4.  Seam  Strength  Values  Before  Standardizing 
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These  eleven  standardized  Ui  values  are  then  multiplied  by  the  respective  weights  given  to 
the  tests  and  summed.  This  summed  value  is  then  the  U i  value  for  computing  the  p  error  for  the 
aggregated  tests.  That  is,  if  all  suits  had  degraded  to  95%  of  their  minimum  requirement,  we 
would  want  to  accept  the  suits  10%  of  the  time  with  p=  .10.  Note  that  when  the  results  are 
standardized  in  this  fashion,  Uo ,  the  minimum  requirement  for  this  Aggregated  Sequential 
sampling  technique  will  always  be  zero,  and  U\  will  be  a  negative  value. 

One  note  for  the  Dynamic  Absorption  test.  Since  this  test  has  a  maximum  number  as  its  test 
requirement,  the  value  for  its  standardized  normal  is  easily  calculated  by  subtracting  the  actual  test 
value  from  the  maximum  requirement  rather  than  subtracting  the  requirement  from  the  actual  test 
value.  The  difference  is  still  divided  by  the  standard  deviation  and  the  result  is  then  weighted  and 
added  with  the  rest  of  the  tests  in  the  given  manner. 

For  our  simnlarinn,  we  weighted  the  laundered  and  unlaundered  Chemical  Adsorption  test  at 
.25  each  and  the  other  nine  tests  were  weighted  at  .0556  each.  This  is  because  we  feel  Chemical 
Adsorption  is  the  most  important  characteristic  of  the  suits.  These  eleven  test  weights  sum 
approximately  to  one.  Using  the  above  method  for  determining  the  aggregated  Ux  value,  we  get  U\ 
-  -.2882.  Since  P=.10,  we  want  only  a  .10  probability  of  accepting  the  suits  when  sum  of  the 
standardized  test  results  is  -.2882.  Obviously,  the  most  heavily  weighted  tests  have  greater 
influence  in  whether  a  suit  passes  or  fails.  The  sum  of  these  eleven  standardized  tests  have  a 
normal  distribution  that  has  a  standard  deviation  of  .391,  which  is  found  by  summing  the  squares 
of  the  eleven  weights  and  taking  the  square  root  of  the  sum.  See  Figure  6  for  a  graph  of  the 
Aggregated  test  result  values. 
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Figure  6.  Aggregated  Sequential  Values 

It  is  important  to  point  out  that  this  is  the  only  methodology  presented  in  this  thesis  that 
rejects  or  accepts  the  suits  based  upon  the  performance  of  aU  tested  characteristics.  In  a  sense,  we 
are  modeling  the  decision  makers  choice  on  what  weights  to  give  each  test.  In  the  other 
methodologies,  we  just  find  whether  a  suit’s  characteristic  would  be  rejected  or  accepted  based 
upon  its  respective  test.  In  these  other  methodologies,  the  decision  maker,  taking  into  account  the 
information  fiom  the  different  tests,  would  decide  whether  or  not  to  reject  the  suit. 

3.7  Truncated  Sequential  Sampling 

Truncated  Sequential  Sampling  is  the  same  as  the  standard  Sequential  Sampling  with  the 
exception  that  a  finite  limit  of  suits  is  pre-determined  before  sampling  is  started.  If  an  accept  or 
reject  decision  is  not  attained  by  the  time  the  suit  limit  is  reached,  a  decision  is  made  at  this  time  to 
either  accept  or  reject,  even  though  the  test  may  say  to  continue  sampling.  There  are  various 
methods  to  detennine  whether  to  accept  or  reject  once  the  test  limit  has  been  reached.  We  simply 
divide  the  accept  and  reject  region  in  half  and  whatever  side  the  final  test  value  is  closer  to, 
determines  the  decision  to  accept  or  reject  the  suits.  See  Figure  7  for  a  graphical  representation  of 
the  truncated  sequential  plan. 
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Number  of  Samples 
Figure  7.  Truncated  Sequential  Plan 

Note  that  in  Figure  7,  that  if  by  32  samples  the  plot  of  the  sum  of  the  test  results  versus  the 
number  of  samples  has  not  crossed  into  either  the  Reject  or  Accept  Region,  the  population  will  be 
sentenced  as  determined  by  which  side  of  the  center  line  the  plot  falls  on  when  32  samples  have 
been  taken. 

Our  methodology  will  be  as  such.  We  start  off  in  the  first  year  of  testing  with  360  suits 
available  at  our  disposal  for  testing.  We  will  assume  the  suits  wiU  remain  in  service  for  the 
planned  16  years,  thus  we  will  sample  suits  each  year  from  year  5  to  16,  for  a  total  of  twelve  years. 
This  gives  an  average  of  30  suits  per  year.  So  for  the  first  year  of  testing,  we  will  truncate  testing 
at  30  suits.  Our  hope  is  that  somewhat  less  than  30  suits  will  be  sampled  in  the  first  year. 

For  example,  say  only  8  suits  are  tested  in  the  first  year.  This  then  leaves  us  with  352  suits  to 
be  tested  over  the  remaining  eleven  years.  352  divided  by  1 1  now  gives  us  32  suits  to  be  tested 
each  year.  This  is  how  we  will  determine  the  maximum  number  of  suits  to  be  tested  each  year. 

In  running  the  sequential  sampling  for  each  of  the  eleven  different  tests,  we  will  quit  testing 
when  either:  1)  we  reach  our  maximum  suit  limit  for  the  year,  or  2)  when  of  all  the  tests  have 
made  a  decision  to  either  accept  or  reject  before  the  maximum  suit  limit  has  been  reached, 
whichever  comes  first.  Even  though  a  tested  characteristic  may  have  made  an  accept  or  reject 
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decision  after  only  a  few  suits  were  sampled,  we  will  still  keep  running  the  same  tests  as  long  as 
other  tested  characteristics  still  need  samples  to  be  tested  in  order  to  make  a  decision.  This  will 
allow  more  information  to  be  gained  on  the  actual  values  of  the  tested  characteristics  and  every  test 
will  now  have  the  same  number  of  samples  tested. 

For  our  simulation  of  this  methodology,  we  did  not  allow  Dynamic  Adsorption  to  determine 
if  sampling  should  be  continued.  Tlie  reason  for  this  is  its  variance  is  so  large,  that  it  always  takes 
more  than  30  samples  to  determine  whether  to  accept  or  reject  at  our  given  a  and  p  values.  It  may 
be  possible  to  determine  that  the  variance  of  the  Dynamic  Adsorption  test  is  smaller  than  has  been 
estimated  by  computing  the  sample  variance  after  the  baseline  tests  are  run.  For  now  however,  we 
will  assume  that  the  variance  will  remain  the  same  as  the  value  we  have  assigned.  If  Dynamic 
Adsorption  were  included  in  the  determination  when  to  quit  sampling,  we  would  automatically  be 
sampling  30  suits  every  year  with  no  savings  in  suits  in  the  earlier  years. 

3.8  Bayesian  Sampling 

This  method  uses  the  methodology  of  Launer  and  SingpurwaUa’s  “A  Bayesian,  Sample 
Theoretic  Approach”  presented  in  the  literature  review,  section  2.8.2.  One  small  change  is  made 
to  their  approach.  In  their  approach,  they  defined  A  to  be  a  fixed  constant  so  that  the  test  always 
tries  to  detect  a  degradation  of  A  in  the  reliability.  In  our  method,  A  is  always  updated  so  that  the 
test  can  detect  a  degradation  in  the  reliability  of  the  suits  below  a  given  value,  this  value  being  the 
minimum  requirement  of  the  tests.  A  is  then  decreasing  as  the  reliability  of  the  suits  degrades.  A 
is  then  the  difference  between  the  current  estimated  mean  of  the  reliability  and  the  minimum 
accepted  value  of  reliability. 

In  this  Bayesian  Sampling  method,  the  two  parameters  for  the  Beta  distribution  (we  will  call 
A  and  B)  and  A  are  chosen  by  the  decision  maker,  a  and  P  are  defined  as  before  at  .05  and  .10 
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respectively.  The  Bayesian  algorithm,  attempts  to  find  the  minimum  number  of  tests  needed  in 
order  to  detect  a  degradation  of  A  in  the  reliability  of  the  suits  while  keeping  a  and  p  at  their 
required  values.  The  Beta  distribution  (Beta(A,B))  is  the  probability  distribution  of  the  parameter 
p, ,  probability  of  a  successful  trial  at  time  t,  in  the  Binomial  distribution.  Recall  that  this  Bayesian 
sampling  plan  and  the  Binomial  distribution  uses  pass/fail  criteria  for  their  tests.  Since  this 
sampling  plan  uses  attribute  sampling,  we  need  to  convert  the  variable  sampling  results  to  attribute 
results.  This  can  simply  be  done  by  regarding  a  test  above  the  minimum  requirement  as  a  pass  and 
a  test  below  the  minimum  requirement  as  a  fail. 

The  mean  of  the  Beta(A,B)  distribution  is  AI(A+B).  We  can  determine  a  beta  prior 
distribution  for  the  criteria  being  tested  by  finding  the  probability  that  the  tested  criteria  will  be 
above  the  minimum  requirement  for  that  particular  test.  We  can  do  this  given  we  know  the  tests’ 
starting  means,  standard  deviations,  and  minimum  requirement  values,  again  assuming  a  normal 
distribution. 

For  example,  the  Spray  Rating  test  starts  with  a  mean  of  98.17,  a  standard  deviation  of  8.68, 
and  a  minirmim  requirement  of  90.  We  want  to  know  what  is  the  probability  we  will  get  a  test 
result  of  90  or  higher  given  we  have  a  N(98. 17,8.68)  distribution.  This  is  done  by  converting  this 
Normal  distribution  to  the  Standard  Normal  distribution  and  looking  up  the  value  in  a  standard 
normal  probability  table.  In  this  case,  (90-98.17)78.68  =  .941.  Looking  this  value  up  in  a 
standard  normal  table,  we  find  there  is  a  .826  probability  that  the  test  will  be  above  the  minimum 
requirement.  Figure  8  gives  a  visual  representation  of  finding  the  probability  that  a  Spray  Rating 
result  wUl  be  above  the  minimum  requirement  (90)  when  its  tme  mean  is  98. 17.  Values  greater 
than  90  are  considered  a  pass,  values  less  than  90  are  considered  a  faU.  The  area  under  the  curve 
to  the  right  of  the  minimum  requirement  is  the  probability  that  a  result  of  the  Spray  Rating  test  will 
pass.  This  area  is  .826. 
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Figure  8.  Spray  Rating  Probability  of  Passing  Minimum  Requirement 

We  now  set  AI(A-i-B)  equal  to  this  probability  of  a  test  being  above  the  minimum  requirement 
and  also  setting  A+B  equal  to  some  value,  say  10.  In  this  way,  the  Beta(/4,jB)  distribution  starts  out 
with  a  probability  equal  to  the  probability  of  a  single  test  being  greater  than  the  minimum  value. 

We  equate  A-\-B  =10  since  this  seems  to  be  a  reasonable  value.  As  the  sum  of  A+6  gets  larger, 
the  variance  of  the  Beta  distribution  gets  tighter  and  the  prior  distribution  is  given  more  weight  in 
effecting  the  outcome  of  the  accept  or  reject  decision.  On  the  other  hand,  as  A+B  gets  smaller,  the 
variance  for  the  Beta  distribution  becomes  very  wide  and  has  little  to  no  influence  on  the  sampling. 
Further  study  will  be  needed  in  selecting  the  proper  value  for  A+B. 

The  value  for  A  is  determined  by  the  difference  in  the  mean  of  the  Beta(A,B)  distribution  and 
0.5.  This  is  because  when  the  actual  mean  of  a  suit’s  measured  test  value  is  equal  to  the  minimum 
required  value  for  that  test,  it  has  a  0.5  chance  of  passing.  This  is  due  to  the  symmetry  of  the 
normal  distribution  and  the  fact  that  we  regard  a  pass  as  a  value  being  greater  than  the  minimum 
requirement  and  a  failure  as  a  value  less  than  the  minimum  requirement.  In  this  way,  we  want  to 
know  when  the  degradation  of  the  suits  has  fallen  to  less  than  a  0.5  probability  of  passing  a  test. 
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which  indicates  the  mean  value  of  the  suits  tested  parameter  is  below  the  minimum  requirement. 
See  Figure  9  for  a  visual  representation. 


Min  Reqmt 
=  True  Mean 


Figure  9.  True  Mean  is  the  Minimum  Requirement  - 
Values  to  Right  Pass,  Values  to  Left  Fail 


Table  5.  Beta  Parameters  and  Delta  Values 


TEST 

PARAMETER  A 

PARAMETER  B 

Delta 

BREAK 

STRENGTH  (W) 

9.8 

0.2 

0.48 

BREAK 

STRENGTH  (F) 

9.9 

0.1 

0.49 

TEAR 

STRENGTH  (W) 

6.4 

3.6 

0.14 

TEAR 

STRENGTH  (F) 

6.4 

3.6 

0.14 

SEAM 

STRENGTH 

9.9 

0.1 

0.49 

SPRAY 

RATING 

8.3 

1.7 

0.33 

DYNAMIC 

ADSORPTION 

7.5 

2.5 

0.25 

CHEMICAL 

ADSORPTION 

9.9 

0.1 

0.49 

In  the  case  of  the  Spray  Rating  test,  the  test  has  a  .826  probability  of  passing,  so  we  set  A 
equal  to  .326.  We  set  A  equal  to  .326  since  .826  -  .5  =  .326.  We  want  to  know  when  the 
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reliability  has  degraded  more  than  .326,  (i.e.,  the  suit  would  then  have  less  than  a  .5  probability  of 
passing  the  test)  which  indicates  the  reliability  is  below  the  minimum  requirement.  A  table  of  the 
selected  values  for  A  and  the  A  and  parameters  of  the  Beta  distribution  for  each  test  is  given  in 
Table  5. 

The  Beta  parameter  A  is  updated  after  every  year  of  testing  by  adding  A  to  the  number  of  tests 
that  pass.  The  beta  parameter  B  is  also  updated  each  year  by  adding  B  to  the  number  of  failures  in 
a  particular  test.  The  new  estimated  value  of  the  probability  of  the  suits  ability  to  pass  a  certain 
test  is  then  (A  +  Passes)  /  (A  +  Passes  +  B  +  Fails).  This  is  a  direct  result  of  the  Beta  distribution 
being  a  natural  conjugate  for  the  Binomial  sampling  distribution  as  given  in  the  literature  review. 

To  find  the  number  of  tests  that  need  to  be  conducted  /it ,  and  the  number  of  successes  needed 
;ct ,  Launer  and  Singpurwalla  show  that  the  following  inequalities  must  be  solved. 


ipj'Ot-j)' 

and 


where  gipi )  is  the  beta  distribution. 

Launer  and  Singpurwalla  then  show  that  these  two  inequalities  can  be  rewritten  as  follows: 


r(  A  +  B)  -  j  +  ^  ^ 

T(A)T(B)p[j)  r(ni+A  +  B) 


53 


r/  4  ir /  R  ^  ^  i 


r(A)r(B) 


J 

j=o^  •'  ^ 


X 


1=0^  d  m=0^  ^  A 


(48) 


>  1-B 


They  claimed  that  both  of  these  inequalities  are  increasing  functions  of  Xt  but  we  proved  this  not  to 
be  true.  Eq(47)  is  an  increasing  function  of  Xi ,  but  Eq(48)  is  not.  This  is  important  because  the 
algorithm  they  gave  for  solving  these  inequalities  requires  this  property.  The  algorithm  had  to  be 
modified  slightly  to  achieve  the  correct  answer.  With  the  correct  algorithm,  the  smallest  values  of 
rit  and  Xt  can  be  found  that  satisfy  die  above  inequalities.  The  FORTRAN  program  for  this 
algorithm  and  the  accompanying  simulation  is  given  in  the  Appendix  A. 


3.9  Simulation 

With  aU  of  the  proposed  sampling  plans  developed,  we  will  mn  various  simulations  to  help 
determine  the  ability  of  each  sampling  method  to  properly  sentence  a  population  of  suits  as  either 
good  or  bad.  The  simulations  are  mn  in  the  simulation  language  SLAM  II,  except  for  the 
Tmncated  Sequential  Sampling  and  Bayesian  plan,  which  are  mn  in  FORTRAN.  Pritsker  gives  a 
nice  introduction  to  SLAM  II  for  the  interested  reader  (Pritsker,  1986). 

To  simulate  the  degradation  of  the  characteristics  of  the  suits,  six  different  WeibuU  survival 
functions  are  used.  The  Weibull  distribution  was  chosen  since  the  shape  of  its  survival  function 
can  be  greatly  varied.  In  the  simulations,  the  variances  are  assumed  to  remain  constant,  while  the 
means  of  the  suits'  characteristics  decrease  in  proportion  to  the  WeibuU  survival  function.  For 
example,  if  a  characteristic  of  a  population  of  suits  starts  out  at  time  0  with  mean  100  and  at  r  =  6 
the  value  of  the  WeibuU  survival  function  was  .88,  then  the  population’s  mean  at  time  r  =  6  for  that 
characteristic  would  be  88.  Figure  10  gives  a  visual  picmre  of  the  survival  functions  for  the  six 
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Figure  10.  Degradation  Graph  of  the  WeibuU  Survival  Functions 


Table  6.  Degradation  Values  from  the  WeibuU  Survival  Functions 


DEG1 

DEG  2 

DEG  3 

DEG  4 

DEG  5 

DEG  6 

YEARS 

1 

0.999 

0.883 

0.78 

0.679 

0.969 

YEARS 

1 

0.998 

mmi 

0.751 

0.654 

YEAR? 

1 

0.993 

■la 

0.723 

0.632 

0.93 

YEARS 

1 

0.983 

0.812 

0.697 

0.613 

0.904 

YEARS 

1 

0.961 

0.789 

0.672 

0.595 

0.873 

YEAR  10 

0.998 

0.921 

0.766 

0.649 

0.578 

0.838 

YEAR  11 

0.993 

0.852 

0.744 

0.628 

0.563 

0.799 

YEAR  12 

0.976 

0.744 

0.723 

0.607 

0.549 

0.757 

YEAR  13 

0.923 

0.596 

0.587 

0.536 

0.711 

YEAR  14 

0.784 

0.68 

0.568 

0.523 

0.664 

YEAR  15 

0.245 

0.66 

0.55 

0.511 

0.614 

YEAR  16 

0.165 

0.11 

0.64 

0.533 

0.5 

0.564 

WEIBULL 

PARAMETERS 

1ST  PARAMETER 

0.065 

0.07 

0.03 

0.035 

0.03 

0.05 

2ND  PARAMETER 

15 

7 

1.1 

0.8 

0.5 

2.5 
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different  Weibull  functions  used.  Table  6  gives  the  values  of  the  six  Weibull  functions  for  the 
twelve  years  that  are  tested,  years  through  five  through  16. 

For  each  sampling  plan  that  is  developed,  including  the  original  sampling  plan,  the  sampling 
plan  is  run  1000  times  for  each  of  the  six  weibull  degradation  functions.  The  exception  to  this  is 
the  Bayesian  plan,  which  was  only  run  100  times  each  due  to  the  extraordinary  long  simulation 
runs. 

Each  of  the  1000  runs  simulate  all  twelve  years  of  the  sample  plans  for  aU  of  the  eleven  tested 
characteristics  given  to  a  suit.  As  mentioned,  each  of  the  runs  are  also  simulated  across  the  six 
weibull  degradation  functions.  Each  run  for  the  original  sampling  plan  consists  of  sampling  either 
10, 20,  or  30  normally  distributed  random  numbers  depending  on  which  test  is  being  given  to  a 
suit.  Likewise,  each  run  for  a  sequential  sampling  plan  samples  as  many  normally  distributed 
random  numbers  as  needed  by  the  sequential  plan. 

The  SLAM  and  FORTRAN  programs  can  be  seen  in  Appendix  A.  Having  shown  how 
our  methodology  has  been  constructed,  we  will  now  explore  the  data  analysis  of  our  simulation 
runs. 


56 


4.  Data  Analysis 


4.1  Introduction 

TWs  chapter  will  examine  the  results  obtained  from  using  the  methodologies  introduced  in 
Chapter  3.  We  will  take  a  general  view  of  the  results  and  identify  which  methodologies  warrant  an 
in  depth  view  of  the  data  obtained  from  the  simulations.  A  measure  of  effectiveness  will  be  derived 
and  applied  to  these  methodologies.  The  results  will  then  be  compared  between  the  various 
methodologies. 

4.2  General  View  of  Data 

The  spreadsheets  that  contain  the  simulation  data  and  guidance  for  reading  them  can  be  found 
in  Appendix  B.  One  of  the  spreadsheets  is  displayed  in  Table  7.  This  particular  spreadsheet 
displays  the  data  obtained  from  simulating  the  Air  Force’s  original  sampling  plan  using  the  second 
degradation  function  and  Condition  I  values.  The  number  accepted  values  in  the  table  represent 
the  number  of  times  that  the  population  of  suits  were  accepted  for  that  particular  characteristic  out 
of  1000  gimnlarinns  The  years  in  which  the  various  characteristics  of  the  suit  first  degrade  helow 
the  mininfiiim  values  are  boxed  in  bold  for  easier  viewing.  One  of  the  first  things  noticed  when 
viewing  the  data,  is  that  the  different  characteristics  degrade  to  the  minimum  values  at  different 
rates.  The  reason  for  this  is  twofold.  First,  the  starting  means  for  the  different  characteristics  start 
at  various  distances  from  their  minimum  values,  and  second,  the  standard  deviations  are  different, 
which  effect  the  rejection  region.  Since  the  characteristics  degrade  at  different  rates,  some  tests 
vnll  reject  the  suits  before  other  tests.  Obviously,  a  nice  quality  one  would  want  in  a  sampling 
methodology  is  its  ability  to  accurately  detect  the  degradation  of  characteristics  in  the  same  year 
that  they  fall  below  the  minimum  requirement. 
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Table  7.  Simulation  Spreadsheet,  Original  Sampling  Plan,  2nd  Degradation  Function 


AF  Sample  Plan  1, 

^eibu 

(.07,  7) 

Min  Value  / 
Reject 
Value 

Mean/ 
Std  Dev 

Year  5 

Year  6 

Year  7 

Year  8 

Year  9 

Year 

10 

Year 

11 

Year 

12 

Year 

13 

Year 

14 

Year 

15 

Year 

16 

TEST 

Degradation 

EE^ 

■uiggl 

■Su 

Break 

190 

HKiSESH 

EgElcl 

■ns 

Khicia 

gyiiial 

181824 

1000 

B^ 

■E] 

bbq 

Ibb] 

0 

HHUH 

l^Hi 

■■ 

j^^B 

■■B 

iBB 

'B^B 

!■■§ 

IB^B 

'l■i 

115 

■ESD 

■HSS 

Kicgfa 

IHLE! 

Iks 

FlUdlfittllUB 

112.079 

IKB] 

■fHH] 

■m 

■ttwd 

bbi 

muim 

MUItiq 

bees 

^BEI 

II^B] 

0 

0 

_ , 

_ , 

Tear 

10 

■DIE 

■Mggi 

■MeVJ 

■EES 

8.946 

7.812 

6.258 

4.41 

2.573 

1155 

mTTiTTilIlkVjl 

9.597 

■EES 

■  yjgg-igi 

MTSIiin 

EWtlil 

■ES 

5 

0 

0 

0 

0 

0 

^■H 

■^1 

■■ 

BBi 

■B 

B^ 

!^■i 

B^l 

^B1 

I^^B 

IBB 

^^B 

7 

■ns 

Mggra 

EEsS 

MMi 

Msa 

i^ita 

6.597 

■EES 

■Bl 

■riliTil 

fSTtin 

behd 

■ES 

bbei 

^bb 

i^^E] 

bbq 

li^Bl 

■■ 

^^B 

{■B 

{^■1 

l^^l 

^BH 

70 

100 

Mean 

99.9 

99.8 

99.3 

98.3 

96.1 

92.1 

85.2 

74.4 

59.6 

42 

strength 

66.997 

10 

#  Accepted 

1000 

1000 

iiinir»i*i 

1000 

1000 

1000 

1000 

1000 

0 

0 

■■ 

■■ 

IBB 

I^Bi 

BUB 

^^B 

■^B 

■B 

B^l 

Mean 

MtlMl 

EUS! 

BEEID 

84.966 

#  Accepted 

1000 

1000 1 

1000 

1000 

1000 

970 

323 

0 

0 

0 

BS 

■■ 

■■ 

bbi 

.Bi^ 

^^B 

■■1 

Spray 

90 

M '  VH 

ESSS 

liiltigfl 

gilKjcl 

ggftia 

BEEJ 

Rating  (L) 

86.64 

■Ttliia 

T«T»!q 

bees 

wmm 

0 

0 

0 

0 

0 

iinnussM 

Mean 

13.71 

13.73 

13.8 

13.93 

14.23 

14.78 

15.73 

17.21 

24.04 

25.89 

25.537 

#  Accepted 

1000 

1000 

1000 

1000 

1000 

1000 

999 

1000 

bees 

BEES 

699 

440 

■■I 

^^B 

bbi 

^Bl 

bbi 

■■ 

bhi 

■^B 

^^B 

■§■ 

20  (max) 

■EE 

Mean 

■EEI 

tarn 

■EES 

MIPM 

■PITiej 

EUH 

Kglg! 

1^^ 

23.69 

#  Accepted 

1000 

10001 

1000 

1000 

1000 

1000 

999 

999 

983 

836 

448 

—m 

■■ 

■■ 

■BB 

■BB 

■BB 

BBI 

^^B 

1.3 

■iss 

Mean 

Elena 

Elchgl 

MEI3n 

■gjilcri 

1132 

IKilR^ 

mum 

MiMtl 

mum 

■BS 

■BE] 

I^^E] 

BBE) 

^■■HH 

■■ 

^Bl 

■■B 

^B1 

B^l 

■■B 

B^l 

Chem(L) 

1.3 

Mean 

2.318 

2.315 

2.304 

2.281 

2.137 

mwtiA 

0.568 

■tlkwa 

Adsorption 

1.118 

0.29 

#  Accepted 

1000 

1000 

1000 

1000 

1000 

1000 

0 

0 

_ 21 

A  point  also  needs  to  be  made  about  differences  between  the  simulations  we  labeled 
Condition  I  and  those  we  labeled  Condition  II.  Recall  Condition  I  simulations  are  those  where  Uo 
is  equal  to  the  Tninimiim  reject  value  and  Ui  is  equal  to  95%  of  the  reject  value.  Condition  II 
simulations  are  those  where  Uo  is  equal  to  the  starting  mean  and  Ui  is  equal  to  the  minimum  reject 
value.  As  mentioned  in  Chapter  3,  the  noticeable  difference  between  these  two  conditions  of 
simulations  is  that  Condition  II  simulations  will  reject  suits  earlier  since  it  has  a  “higher  standard” 
for  accepting  suits.  But  another  difference  that  has  significant  effects  on  the  Type  I  and  II  errors  is 
the  difference  of  the  distances  between  Uo  and  Ui.  The  difference  in  distances  plays  a  significant 
role  in  detennining  how  the  suits  wiU  be  accepted  and  rejected.  In  general,  when  the  distance 
between  Uo  and  Ui  is  large,  it  takes  less  samples  to  meet  the  a  and  p  requirements.  In  general,  we 
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can  expect  to  see  smaller  Type  I  and  II  errors  using  Condition  II  simulations,  since  it  is  easier  to 
distinguish  if  a  population  has  a  mean  of  Uo  or  Ui .  Table  8  shows  the  differences  in  the  distances 
ofUo  andUi . 


Table  8.  Distance  Between  Uo  and  Ui 


CONDITION 

Break 

Strength 

(W) 

Break 

Strength 

(F) 

Tear 

Strength 

(W) 

Tear 

Strength 

(FJ 

Seam 

Strength 

Spray 

Rating 

Dynam. 

Adsorp. 

Chem. 

Adsorp. 

1 

9.5 

5.75 

.5 

.35 

3.5 

4.5 

1 

.065 

II 

56 

44.2 

.5 

.5 

30 

8.17 

6.3 

1.02 

Most  of  the  methodologies  are  accurate  in  rejecting  suits  when  there  is  a  sharp  degradation. 
For  instance  in  Table  9,  the  bold  boxes  in  the  spreadsheet  indicate  the  first  year  in  which  a 
characteristic  degraded  below  the  minimum  requirement.  Since  the  degradation  value  drops 
relatively  quickly  in  years  14, 15,  and  16,  note  how  few  suits  are  accepted  after  a  characteristic  has 
degraded  below  the  minimum  requirement.  On  the  opposing  side,  methodologies  generally  take  a 
few  years  to  reject  a  suit  when  the  characteristic  slowly  degrades  past  the  minimum  requirement. 
An  example  of  this  is  presented  in  Table  10.  Again  the  bold  boxes  indicate  the  first  year  in  which 
a  characteristic  degraded  below  the  minimum  requirement.  Note  that  even  a  few  years  after  a 
characteristic  has  degraded  below  the  minimum  requirement,  the  methodology  still  accepts  many 
suits. 

The  reason  that  many  suits  are  still  accepted  after  the  characteristics  have  degraded  to  the 
minimum  requirement  is  that  the  test  is  set  up  so  that  at  the  minimum  requirement  Uo ,  95%  of  the 
suits  should  stiU  be  accepted.  As  the  suits’  characteristics  degrade  further  below  the  minimum 
requirement,  a  greater  percentage  of  suits  should  be  rejected.  Many  sampling  methodologies 
contain  another  value,  which  we  have  called  Ui ,  with  the  idea  being  that  when  the  actual  mean  of 
the  characteristic  has  degraded  to  Ui ,  we  would  like  to  accept  the  item  being  tested  with  a 
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probability  of  P .  Figure  1 1  gives  an  example  of  an  Operating  Characteristic  (O.C.)  curve 
representing  this  idea.  Calculations  to  obtain  the  O.C.  curve  can  be  found  in  many  statistical 
books  (Kapur  and  Lamberson,  1977).  One  of  our  main  concerns  with  the  Air  Force’s  original 
sampling  plan  is  that  there  is  no  mention  p  or  a  Type  II  error.  We  will  now  look  at  the  data  from 
simulating  various  methodologies  and  observe  how  weU  they  did. 


Table  9.  Example  of  Sharp  Degradation 


AF  Samole  Plan  1.  Weibull  (.065,15) 

Min  Value  / 
Reject 
Value 

Mean/ 
Std  Dev 

Year  5 

Year  6 

Year  7 

Year  8 

Year  9 

Year 

10 

Year 

11 

Year 

12 

Year 

13 

Year 

14 

Year 

15 

Year 

16 

TEST 

Degradation 

MHI 

B^B 

mm 

BHD 

IHD 

Ed^QBI 

KS 

BBS 

g*^kjtd 

gtggin 

■EiaE] 

■EglEI 

Mtm 

mm 

mim 

■Ulild 

BEEE] 

BBEl 

HBEl 

I^HHI 

I^^B 

■■ 

■BB 

^HH 

^■B 

B^H 

B^H 

II^B 

BIB 

BBH 

HBB 

HIHiB 

Misrm 

■KWfel 

KEggl 

■Mli} 

Mtlim 

muim 

Munm 

TtTtin 

■ttlilil 

■hitin 

BHEl 

BI^E] 

B^^ 

IHH 

■BB 

HBI 

BIH 

bbb 

IBH 

I^BB 

BHB 

II^H 

■CSIEH 

■EE 

■EE 

HS 

BP 

9^ 

gNg[:] 

K[«lgg1 

I 

■«itcnkl 

BUg 

■BESS 

mimti 

BBEil 

0 

— ^ 

■■ 

^■B 

B^^ 

jj^^B 

B^H 

B^H 

BIB 

Bi^l 

^HB 

^■B 

^■i 

Tear 

7 

7.5 

IIK^M3B| 

HIES 

■RH 

WBE 

HE 

g^Wcl 

■Efnn 

6.597 

1.342 

f «TiIil 

1000 

1000 

1000 

1000 

1000 

1000 

904 

1 

0 

0 

Seam 

70 

■ES 

WESl 

■EEl 

■DE 

HEEl 

BEaE 

IHSB 

■ri:gi 

50.5 

16.5 

Strength 

Munm 

■hlOl 

mim 

Mttol 

0 

0 

IBB^^HI 

B^H 

bbh 

■BB 

^HH 

^BH 

^^H 

^^H 

Bj^H 

II^B 

BI^B 

■IB 

;^IB 

Spray 

90 

Mtltlfi 

E3B 

gi!:m 

fcTnra 

gtTiro 

gtWEH 

Kyjgrn 

K>MI 

1^^ 

IB^E 

Rating  (UL) 

84.966 

#  Accepted 

mm 

1000 

1000 

1000 

1000 

1000 

979 

4 

0 

— 3 

. 

I^HI 

■■ 

bbh 

BBH 

HBB 

BBB 

!^BB 

Spray 

90 

HE2!I3iBlli 

EHiB 

g?!:m 

mrntyj 

reggrn 

gt!:-f:TI 

IBEIS 

Rating  (L) 

86.64 

Mtlilil 

■Bliig 

Mtium 

■tiHH] 

HE] 

— ^ 

■§■ 

B^H 

H^H 

H^H 

^BIH 

i^BBI 

^BBI 

IB^B 

czsszcm 

■RTilRT^ 

BEES 

■EB 

BOB 

bkb 

■EB 

■EB 

■EE 

gEH»!e1 

■ni^ 

gjjlgl 

mm 

MWtltl 

Ttwn 

Mim 

■rtigiii 

MHUld 

BE^ 

BEI^ 

^EE] 

■BBI 

IHI 

IB^^ 

B^^ 

B^H 

HIH 

HBI 

^BH 

^^B 

l^B 

'B^B 

I  BBH 

CZSSZOH 

H^SSSl 

heb 

■EE 

■EE 

■EB 

■EB 

■EB 

BEE 

Kgfiigi 

■gig! 

Ktgi:l 

mu^Li 

OiESSSIl 

Munm 

mmn 

mum\ 

Mtum 

^EEB 

HH 

I^^B 

BBH 

IHBi 

■■ 

^HH 

^^H 

B^H 

B^B 

BBB 

!■■ 

iHBi 

esssbssm 

mpKci 

MEE3 

IIBS 

IS^ 

mm 

mMLii 

■g:lti] 

MW¥F\ 

»W>H1 

lltlilil 

mm 

MTOIil 

MiHUi 

Mttum 

iHIilil 

Ttigg 

mim 

I — s 

■■ 

I^HI 

B^H 

B^B 

bbb 

BBH 

^^BB 

1.3 

■iSS 

BB|[i2^^3Bli 

MgcEl 

WEES 

gJEiEI 

WMU 

MW:\U\ 

■w^ 

1.118 

Mim 

IMitfl?! 

iMunni 

BEEQ 

^■Q 
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Table  10.  Example  of  Slow  Degradation 
AF  Sample  Plan  II,  Weibull  (.07,7) _ 


Min  Value/ 

Reject  Mean/ 
Value  Std  Dev 


Year  Year  Year  Year  Year  Year  Year 

Years  Year6  Year?  Yearfl  YearQ  10  11  12  13  14  15  16 


4.3  Pre-Posturing  Results 

Recall  that  the  Pre-Posturing  sampling  plan  samples  the  same  amount  of  suits  as  the  original 
sampling  plan.  The  difference  is  that  the  Pre-Posturing  sampling  plan  does  not  sample  the  same 
amount  of  suits  every  year,  but  samples  fewer  suits  in  the  earlier  years  and  more  suits  in  the  later 
years.  We  can  compare  the  results  of  this  methodology  to  the  results  of  the  original  sampling  plan. 

When  we  evaluate  the  results  of  a  sampling  plan,  we  will  treat  each  test  in  each  year  as  an 
independent  event.  The  reason  for  this  is  that  we  do  not  want  to  make  the  decision  to  accept  or 
reject  a  population  of  suits,  but  we  want  to  be  able  to  accurately  find  when  the  suits’ 
characteristics  have  degraded  below  the  minimum  requirement.  We  will  still  use  the  terms  accept 
and  reject  to  indicate  whether  a  methodology  determines  the  characteristic  is  above  or  below  the 
minimum  requirement. 

The  Pre-Posturing  and  original  sampling  plan  have  eight  different  years  where  the  number  of 
suits  sampled  differ  between  the  two  methodologies.  These  are  years  5, 6, 7, 8, 11, 12, 13  and  14. 
With  six  degradation  functions,  and  eleven  different  suit  tests,  these  combinations  of  years, 
degradation  functions,  and  tests  give  us  a  total  of  528  simulation  results  that  can  be  compared 
between  the  two  methodologies.  Simulations  for  the  Pre-Posturing  sampling  plan  were  run  only 
once,  using  Condition  I  values. 

When  comparing  the  two  methodologies,  we  wUl  count  as  significant  only  those  results  that 
have  a  difference  of  5%  or  more  between  the  two  methodologies.  Out  of  the  528  simulations,  only 
43  (8.14%)  of  the  simulations  have  a  significant  difference  of  5%  or  more.  Table  1 1  shows  the 
years  in  which  the  significant  differences  between  the  simulations  occur  and  the  number  of  suits 
accepted  in  those  years.  Note  that  the  data  in  Table  1 1  is  taken  from  all  of  the  simulations  run  for 
the  Pre-Posturing  and  original  sampling  plan  using  Condition  1  values.  It  is  interesting  to  note  that 
aU  of  the  significant  differences  occur  in  the  simulation  runs  where  the  suits  should  be  rejected  and 
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none  of  the  significant  differences  are  in  runs  where  the  suits  should  be  accepted.  We  define  that  a 
suit  should  be  rejected  when  a  characteristic’s  true  mean  degrades  below  the  minimum  requirement 
Uo,  and  a  suit  should  be  accepted  when  a  characteristic’s  tme  mean  is  above  the  minimum 
requirement  Uo. 

Ihe  reason  for  the  differences  between  the  two  methodologies  being  only  in  simulations  where 
the  suits  should  be  rejected  is  that  in  both  methodologies,  the  critical  region  is  fixed  strictly  by  the 
Uo  value  in  terms  of  a  and  the  standard  deviation.  There  is  no  regard  for  Ui  or  p .  Therefore  both 
methodologies  always  insure  that  there  is  only  a  5%  chance  of  rejecting  a  good  suit  without  regard 
to  the  probability  of  accepting  a  bad  suit.  When  the  characteristics’  true  means  are  at  Uo ,  the  Pre- 
Posturing  and  original  sampling  plan  will  both  accept  the  suits  95%  of  the  time,  as  the  results  of 
the  simulations  show  (See  Appendix  B,  AF  and  Pre-Postuiing  Spreadsheets). 

Observing  Table  1 1,  notice  that  the  years  in  which  each  respective  sampling  plan  does  better, 
i.e.,  accepts  less  suits,  can  clearly  be  divided  by  the  years  5  through  8  and  the  years  1 1  through  14. 
The  reason  is  quite  simple.  The  Pre-Posturing  plan  has  more  suits  available  for  sampling  in  the 
later  years  than  does  the  original  sampling  plan,  whereas  the  original  sampling  plan  has  more  suits 
available  for  sampling  in  the  earlier  years  than  does  the  Pre-Posturing  plan.  When  the 
characteristics  degrade  below  their  minimum  requirement,  more  suits  are  needed  to  accurately 
sentence  the  suits  as  reject  or  accept.  Therefore,  as  a  characteristic  degrades  below  its  minimiun 
requirement,  whichever  sampling  plan  has  the  greater  number  of  suits  to  sample  when  this  occurs 
will  reject  a  larger  percentage  of  suits. 

The  whole  concept  behind  the  pre-posture  plan  is  to  “posture”  more  of  the  suits  in  the 
years  where  it  is  believed  the  characteristics  will  degrade  close  to  the  minimum  requirement.  As 
the  trae  mean  nears  the  miitimum  requirement,  more  suits  need  to  be  sampled  to  get  a  decent  p 
value.  In  the  simulations,  many  of  the  suits  degrade  earlier  than  the  pre-posturing  plan  expects  and 
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the  pre-posture  plan  has  fewer  suits  in  the  earlier  years,  thus  resulting  in  greater  errors.  The 
success  of  the  Pre-Posturing  plan  relies  on  fairly  accurate  prior  knowledge  as  to  when  most  of  the 
characteristics  start  to  degrade  below  the  minimum  requirement.  This  plan  will  succeed  only  if  you 
know  in  which  years  to  posture  the  majority  of  the  suits  for  sampling.  In  fact,  not  only  will  the 
plan  not  succeed  if  you  do  not  posture  the  suits  correctly,  but  it  can  do  much  worse  in  its  ability  to 
reject  bad  suits,  as  seen  in  the  bottom  half  of  Table  11. 


Table  11.  Comparison  Between  Pre-Posture  and  Original  Sampling  Plans 


1  Percentages  Of  Suits  Accepted  When  They  Should  Have  Been  Rejected  \ 

\  Simulations  Where 

Pre-Posture  Plan  Fairs  Better  I 

YEAR 

11 

11 

11 

12 

EBB 

13 

13 

13 

13 

13 

EEB 

14 

ill 

Original 

18.6 

32.3 

59.6 

20.5 

39.3 

8.0 

Pre- 

Posture 

11.5 

19.5 

52.2 

13.4 

21.9 

1.5 

YEAR 

14 

14 

14 

14 

Original 

71.3 

38.6 

90.8 

83.6 

Pre- 

Posture 

59.0 

19.4 

85.8 

71.8 

B 

B 

B 

1  Simulations  Where  Ori 

ginal  Sampling  Plan 

Fairs  Better  I 

15 

5 

5 

5 

5 

5 

5 

5 

6 

B 

47.2 

0.9 

68.4 

0.1 

19.2 

BUI 

mil 

93.1 

77.9 

13.0 

46.0 

76.0 

22.9 

86.1 

12.4 

22.2 

IBB 

YEAR 

6 

6 

6 

7 

7 

7 

8 

8 

8 

8 

Original 

26.4 

8.6 

2.3 

34.9 

77.4 

Pre- 

Posture 

64.4 

22.0 

39.6 

80.4 

10.6 

40.8 

9.4 

8.6 

50.4 

83.4 

As  mentioned,  the  whole  purpose  of  the  pre-posture  plan  is  to  use  fewer  suits  in  the  earlier 
years  where  you  believe  the  actual  means  of  the  suits  are  far  away  from  the  minimum  requirement 
and  to  save  those  suits  for  later  years.  A  plan  that  can  statistically  determine  when  less  suits  are 
needed  for  sampling  is  the  sequential  sampling  plan.  We  wiU  now  look  at  the  results  of  the 
Aggregated  Sequential  plan. 
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4.4  Aggregated  Sequential  Results 

The  purpose  of  constructing  the  aggregated  sequential  plan  was  to  deal  with  the  problem  of 
the  different  tests  requiring  different  numbers  of  samples  in  sequential  sampling  to  meet  the  given 
a  and  P  requirements.  In  this  methodology,  recall  that  weights  are  assigned  to  each  of  the  various 
tests.  The  weights  sum  to  one  and  the  heavier  weights  are  given  to  those  tests  that  are  considered 
to  be  more  important  in  deciding  whether  suits  should  be  rejected  or  accepted.  Unlike  the  other 
methodologies  in  this  thesis,  the  Aggregated  Sequential  sampling  method  models  the  decision  of  the 
decision  maker.  This  sampling  plan  takes  into  account  which  tests  the  decision  maker  believes  are 
of  most  important,  and  models  this  by  appropriately  assigning  weights  to  each  of  the  tests.  For  the 
simulation,  we  assigned  the  two  Chemical  Adsorption  tests  weights  of  .25  each.  The  remaining 
nine  tests  were  each  assigned  weights  of  .0556.  These  weights  sum  approximately  to  one. 

The  results  of  the  simulations  for  the  Aggregated  Sequential  plan  are  displayed  in  Table  12. 
Ihe  years  in  which  the  aggregated  sum  first  degrades  below  Uo  are  surrounded  in  bold.  Note  that 
the  average  sample  size  falls  below  6  samples  in  every  year.  The  original  sampling  plan  samples 
30  suits  per  year.  In  the  standard  sequential  plan,  which  is  introduced  in  later  in  Section  4.8, 
average  sample  sizes  range  usually  between  5  and  80  samples  for  each  test  in  any  given  year. 

There  are  two  reasons  for  such  a  reduction  in  sample  size  using  tiie  Aggregated  Sequential 
plan.  The  first  reason  is  that  by  aggregating  all  the  results,  the  variance  is  reduced.  Recall  in 
Section  3.6  after  the  test  results  were  standardized,  all  the  results  were  normally  distributed  with  a 
standard  deviation  of  1.  After  the  tests  are  aggregated,  the  result  is  a  normally  distributed  random 
value  with  a  standard  deviation  of  .391.  In  general,  sequential  sampling  requires  less  samples  to  be 
taken  as  the  standard  deviation  decreases.  The  second  reason  for  the  smaller  sample  size  is  that 
the  chemical  adsorption  tests  are  weighted  the  most  heavily.  The  chemical  adsorption  tests  usually 
require  a  small  sample  size  and  account  for  50%  of  the  weighted  values  of  the  Aggregated  sample 
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plan.  Conversely,  the  other  less  critical  tests  that  require  large  samples  are  weighted  small  enough 
to  have  little  effect  in  increasing  the  sample  size. 


Table  12.  Aggregated  Sampling  Results 


Aggregated  Sequential  Sample  Plan  I,  Weibull  (.065,15) 

. 'rMin'Valuel' . j . . . [" . [’ . . ! . . I . . I . ”1 . . . . 

I  /  Reject  I  Mean/ 1  j  j  i  j  j  j  Year  i  \ 

Value  i Std  Dev i  \ Year 5\ Year 6 iYear7j  YearSi Year9:  10  i 


The  results  show  that  since  the  Chemical  Adsorption  test  is  weighted  the  most  heavily,  the 
chemical  test  naturally  has  the  largest  influence  on  when  to  faU  the  population  as  a  whole.  If  the 
Chemical  Adsorption  test  shows  an  accept  value,  it  usually  takes  a  combination  of  six  to  seven 
other  tests  showing  a  reject  value  in  order  to  reject  the  suit.  These  results  meet  the  objective  of  the 
test:  to  keep  testing  if  the  tests  that  are  deemed  important  still  require  testing,  and  to  take  away  the 
ability  of  smaller  valued  tests  to  keep  requiring  suits  to  be  tested  just  to  satisfy  the  sampling  needs 
of  a  lesser  valued  test.  While  half  the  weight  in  the  Aggregated  Sequential  plan  is  allocated  to  the 
Chemical  Adsorption  tests,  the  Aggregated  accept  /  reject  decision  does  not  exclusively  follow  the 
accept  /  reject  decision  of  the  Chemical  Adsorption  test.  Table  13  compares  the  results  to  accept 
or  reject  the  suits  based  upon  the  Aggregated  Sequential  plan  and  based  upon  the  Chemical 
Adsorption  alone  which  uses  the  standard  sequential  plan,  to  be  discussed  in  Section  4.8.  Note  that 
especially  under  degradation  functions  three,  four,  and  five,  the  Aggregated  Sequential  plan  rejects 
the  suits  as  a  whole  indifferent  to  the  fact  that  the  Chemical  Adsorption  test  signifies  the  chemical 
adsorption  characteristic  is  still  acceptable.  This  is  an  example  of  the  less  critical  tests  combining 
to  influence  the  critical  tests. 

With  this  plan  we  are  assuming  that  the  different  tests  are  independent  of  each  other  in 
determining  the  variance  of  the  aggregated  normal  distribution.  It  is  pretty  unlikely  that  the  waip 
breaking  strength  is  independent  of  the  fill  breaking  strength  and  the  same  for  the  two  tearing 
strength  tests.  At  this  time  we  do  not  have  the  information  available  to  determine  the  covariances 
between  the  tests.  If  we  are  assuming  the  tests  are  independent  but  tiie  tme  tests  actually  have  a 
positive  covariance,  then  we  have  underestimated  the  actual  variance  of  the  aggregated  sum.  If 
the  variance  of  the  aggregated  sum  is  underestimated,  then  the  Aggregated  Sequential  sampling  will 
require  too  few  samples  to  be  drawn.  This  is  because  the  larger  the  variance,  the  larger  the  number 
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of  samples  that  are  needed  to  meet  the  a  and  [3  requirements.  The  result  is  that  if  we  assume 
independence  when  the  tests  are  not  independent,  too  few  samples  will  be  taken  and  the  Type  I  and 
11  errors  will  be  larger  than  we  want. 


Table  13.  Aggregate  Samples  Relative  to  Chemical  Adsorption  Samples 


NUMBER  OF  SAMPLES  ACCEPTED  ( 

Out  of 

1000) 

Test 

Years 

Year  6 

Year? 

Years 

Year  9 

Year 

10 

Year 

11 

Year 

12 

Year 

13 

Year 

14 

Year 

15 

Year 

16 

Degradation 

1000 

1000 

1000 

1000 

0 

0 

One 

Chem  Adsorp. 

■niiiin 

Hiss 

1000 

1000 

1000 

1000 

1 

0 

Degradation 

Aggregate 

TtliM 

1000 

1000 

1000 

1000 

1000 

998 

4 

0 

0 

Two 

Chem  Adsorp. 

1000 

1000 

»t«r»rii 

1000 

1000 

1000 

1000 

0 

0 

0 

Degradation 

Aggregate 

1000 

1000 

1000 

999 

995 

936 

731 

397 

113 

Three 

Chem  Adsorp. 

1000 

1000 

TitiM 

MWM 

1000 

1000 

1000 

1000 

1000 

1000 

Degradation 

Aggregate 

998 

992 

905 

567 

237 

54 

10 

0 

0 

0 

0 

Four 

Chem  Adsorp. 

1000 

1000 

mimt} 

1000 

1000 

1000 

987 

717 

98 

Degradation 

Aggregate 

693 

307 

69 

7 

3 

0 

0 

0 

0 

Five 

Chem  Adsorp. 

1000 

1000 

1000 

1000 

1000 

998 

974 

675 

145 

25 

3 

_ B 

Aggregate 

msM 

1000 

1000 

1000 

1000 

1000 

999 

979 

463 

24 

_ 

Chem  Adsorp. 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

mMmi 

The  Aggregate  Sequential  plan  is  a  promising  plan.  As  shown  by  the  simulation,  few  samples 
are  needed  in  order  to  satisfy  the  ot  and  p  requirements.  The  cost  of  these  fewer  samples  is 
determining  and  assigning  the  weights  to  be  assigned  to  the  tests.  This  is  a  decision  left  to  the 
decision  maker.  Also,  the  assumption  that  the  tests  are  independent  may  result  in  too  few  samples 
being  drawn  if  covariance  actually  exists.  As  the  real-life  results  of  sampling  become  available,  it 
may  be  possible  to  determine  the  covariance  between  the  test  results.  Since  this  sampling  plan 
assumes  which  tests  are  important  from  a  decision  makers  point  of  view,  the  tests  cannot  be  viewed 
independently  of  each  other.  It  is  for  this  reason  that  we  will  not  compare  how  accurately  the  plan 
accepts  or  rejects  suits  compared  to  the  other  sampling  plans  in  which  the  tests  are  treated 
independently. 
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4.5  Measure  of  Effectiveness 

Before  we  evaluate  the  results  of  the  four  remaining  methodologies  we  wiU  present  a  measure 
of  effectiveness  to  measure  the  results.  The  first  two  methodologies  presented,  the  Pre-Posturing 
and  Aggregated  Sequential  plans,  will  not  be  evaluated  with  the  measure  of  effectiveness.  The 
reason  for  this  is  that  the  sequential  methodologies  accomplish  what  the  pre-postuiing  plan  tries  to 
do:  take  the  most  samples  where  they  are  needed.  We  have  shown  that  the  success  of  the  Pre- 
Posturing  methodology  is  very  dependent  on  accurately  forecasting  which  years  the  suits  will  start 
to  degrade  below  the  minimum  requirements.  If  we  knew  which  years  the  suits  would  start  to 
degrade,  we  would  not  need  sampling  at  all.  The  Aggregated  Sequential  plan  is  not  evaluated  with 
a  measure  of  effectiveness  since  this  plan  is  different  in  and  of  itself.  This  is  the  only  plan  where 
the  tests  combine  to  reject  or  accept  a  suit  and  the  only  plan  that  takes  into  account  the  decision 
maker’s  thoughts  in  assigning  the  relative  importance  to  the  various  tests. 

We  need  a  measure  of  effectiveness  that  reflects  the  performance  of  the  various 
methodologies,  yet  is  simple  enough  for  a  non-statistician  to  understand.  Our  first  thought  was  to 
assign  a  cost  function  to  the  results.  This  proves  to  be  get  too  unwieldy  for  the  scope  of  this  thesis, 
since  we  would  not  only  need  the  costs  for  taking  samples,  but  also  the  costs  and  probabilities  of 
making  a  Type  I  and  Type  II  error.  Costs  that  would  need  to  be  determined  include  the  cost  of  an 
Air  Force  member’s  death  and  the  cost  of  the  remairting  life  of  a  suit  if  it  were  rejected  too  early. 
We  would  also  need  the  probabilities  of  chemical  attacks  in  a  given  length  of  time  and  the 
probability  that  an  airman  involved  in  a  chemical  attack  was  wearing  a  degraded  suit.  This 
measure  of  effectiveness  would  prove  to  rely  too  much  on  the  accuracy  of  the  cost  function  as 
opposed  to  the  actual  performance  of  the  methodologies. 

The  measure  of  effectiveness  we  chose  to  use  is  one  that  measures  the  accuracy  of  the 
methodology  in  correctly  rejecting  a  suit  when  a  characteristic  has  degraded  below  the  minimum 
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requirement.  This  is  accomplished  by  finding  the  percentage  of  suits  that  were  correctly  rejected  in 
the  same  year  the  suit  degraded  past  the  minimum  requirement.  The  measure  of  effectiveness  also 
finds  the  percentage  of  suits  that  were  rejected  earlier  than  they  were  supposed  to  and  which 
methodologies  rejected  suits  after  the  suits  already  passed  below  the  minimum  requirement. 

We  get  our  measure  of  effectiveness  by  finding  the  years  in  which  the  characteristics  degrade 
below  their  minimum  requirements.  We  then  find  the  percentage  of  suits  rejected  in  the  same  years 
as  when  the  characteristics  degraded.  We  regard  this  percentage  as  the  percentage  of  suits  rejected 
correctly.  Similarly,  we  find  the  percentage  of  suits  that  were  rejected  one  year  early,  one  year 
late,  two  years  early,  two  years  late,  etc.  For  each  methodology  we  find  the  average  percentages 
for  the  different  characteristic  tests.  We  give  a  measure  of  effectiveness  percentage  that  is  the 
average  of  how  the  tests  faired  in  accurately  rejecting  a  characteristic  that  has  degraded  below  the 
minimum  requirement.  See  Tables  14  and  15  for  the  measure  of  effectiveness  results. 

For  the  measure  of  effectiveness,  the  greater  percentage  of  suits  that  were  rejected  in  the  Year 
Failed  column,  the  more  success  the  methodology  has  in  correctly  rejecting  suits  the  same  year  a 
characteristic  falls  below  its  minimum  requirement.  In  Tables  14  and  15,  the  successful 
methodologies  are  the  ones  in  which  the  majority  of  the  suits  are  rejected  in  the  year  the 
characteristic  fails.  For  example,  in  Table  15,  the  Truncated  Sequential  result  in  Degradation 
Function  5  is  highly  successful,  rejecting  73.07%  of  the  suits  in  the  correct  year.  Conversely,  the 
methodologies  which  are  not  as  successful  are  those  in  which  many  suits  are  rejected  several  years 
early  or  several  years  late.  An  example  of  this  is  the  Truncated  Sequential  plan  in  Table  14  in 
Degradation  Fimction  6,  where  over  41  %  of  the  suits  were  rejected  over  four  years  late.  The 
Average  Number  of  Samples  column  gives  the  average  number  of  samples  used  over  the  lifetime  of 
the  suit  for  that  particular  sampling  plan. 
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4.6  Attribute  Bayesian  Results 

The  Bayesian  methodology  does  not  fare  as  well  as  we  had  hoped.  In  Table  14  under  the 
Year  Failed  column,  the  Bayesian  results  show  that  the  Bayesian  sampling  plan  is  not  effective  in 
rejecting  suits  after  they  have  failed.  Compared  to  the  other  sampling  plans,  the  Bayesian  is  the 
least  effective  in  all  six  degradation  functions.  The  results  under  the  Years  Late  columns  show  that 
the  Bayesian  methodology  always  lags  behind  the  other  methodologies  in  being  able  to  property 
reject  a  degraded  suit.  The  Bayesian  methodology  clearly  does  not  do  as  well  as  the  other 


methodologies. 


Table  14.  Measure  of  Effectiveness  Results,  Condition  II  Simulation 


PERCENTAGE  OF  SUITS  METHODOLOGIES  REJECTED,  CONDITION  II  SIMS 


METHOD 

>4 

Years 

Early 

4 

Years 

Early 

3 

Years 

Early 

2 

Years 

Early 

1 

Year 

Early 

Year 

Failed 

1 

Year 

Late 

2 

Years 

Late 

3 

Years 

Late 

4 

Years 

Late 

>4 

Years 

Late 

Awg#of 

Samples 

ORIGINAL 

13.4 

4.05 

3.68 

BEn 

Blifil 

9.32 

6.64 

2.01 

0 

360 

SEQUENTIAL 

6.86 

2.08 

2.01 

1.85 

3.24 

9.35 

23.5 

37.6 

13.2 

0.29 

0 

331.78 

TRUNC.  SEQ. 

9.44 

2.32 

2.2 

2.35 

2.64 

5.47 

17.5 

45.7 

10.7 

1.36 

0 

266.24 

BAYESIAN 

2.86 

1 

1.29 

1.14 

1.57 

2.43 

18.7 

26.1 

43.6 

1.29 

0 

321.47 

DEGRADATION  FUNCTION  2 

ORIGINAL 

0 

0 

5.22 

13.7 

26.6 

■ai;m 

360 

SEQUENTIAL 

0 

0 

2.59 

■B 

^Bil 

263.9 

TRUNC.  SEQ. 

0 

0 

4.09 

KBl 

nil 

■1^ 

Bicll 

202.207 

BAYESIAN 

0 

0 

1.71 

0 

wm 

1.43 

gggl 

16.7 

233.14 

0 

0 

0 

■jK] 

MtmA 

360 

SEQUENTIAL 

0 

0 

0 

0 

0 

5^ 

204.87 

TRUNC.  SEQ. 

0 

0 

0 

0 

0 

109.5 

BAYESIAN 

0 

0 

0 

0 

0 

30.4 

0 

15.3 

4.29 

3.86 

37.1 

151.26 

DEGRADATION  FUNCTION  4  I 

ORIGINAL 

0 

0 

0 

0 

0 

87.3 

4.15 

3.12 

2.21 

1.32 

1.94 

360 

SEQUENTIAL 

0 

0 

0 

0 

0 

83.3 

10.3 

4.26 

1.51 

0.43 

0.15 

207.2 

TRUNC.  SEQ. 

0 

0 

0 

0 

0 

85.6 

8.56 

3.55 

1.38 

0.52 

0 

65.03 

BAYESIAN 

0 

0 

0 

0 

0 

53.1 

10.3 

9.86 

8.14 

4 

14.6 

125 

DEGRADATION  FUNCTION  5 

ORIGINAL 

0 

0 

0 

0 

0 

90.9 

4.59 

2.18 

1.18 

0.55 

0.57 

360 

SEQUENTIAL 

0 

0 

0 

0 

0 

94.6 

4.47 

0.76 

0 

0 

0 

181.2 

TRUNC.  SEQ. 

0 

0 

0 

0 

0 

93.9 

3.67 

1.42 

0.59 

0 

0 

51.86 

BAYESIAN 

0 

0 

0 

0 

0 

83.6 

5.43 

2.43 

1.14 

1.57 

5.86 

176.61 

DEGRADATION  FUNCTION  6  I 

ORIGINAL 

0 

0 

0 

0 

0 

34.7 

28 

16.6 

6.58 

2.77 

11 

360 

SEQUENTIAL 

0 

0 

0 

0 

0 

18.7 

17.5 

10 

9.07 

12.7 

32 

230.16 

TRUNC.  SEQ. 

0 

0 

0 

0 

0 

15.8 

16 

4.03 

8.97 

13.9 

41.3 

175.26 

BAYESIAN 

0 

0 

0 

0 

0 

3 

5.57 

15 

12.6 

6.43 

54.7 

185.22 
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Table  15.  Measure  of  Effectiveness  Results,  Condition  I  Simulation 


PERCENTAGE  OF  SUITS  METHODOLOGIES  REJECTED, 

CONDITION  1  SIMS 

>4 

4 

3 

2 

1 

1 

2 

3 

4 

>4 

Years 

Years 

Years 

Years 

Year 

Y  ear 

Failed 

Year 

Years 

Years 

Years 

Years 

Avg  #of 

METHOD 

Early 

Early 

Early 

Early 

Early 

Late 

Late 

Late 

Late 

Late 

Samples 

DEGRADATION  FUNCTION  1  I 

ORIGINAL 

0 

0 

0 

0 

0.56 

58.6 

34 

0 

0 

0 

0 

360 

SEQUENTIAL 

0 

0 

0 

0 

0 

76.7 

23.8 

0 

0 

0 

0 

1017.2 

TRUNC.  SEQ. 

0 

0 

0 

0 

0.57 

76.3 

17.4 

0 

0 

0 

0 

282.48 

DEGRADATION  FUNCTION  2 

ORIGINAL 

0 

0 

0 

0 

0.98 

60.9 

27.5 

7.54 

0 

0 

0 

360 

SEQUENTIAL 

0 

0 

0 

0 

0 

82.1 

17.5 

0 

0 

0 

0 

783.4 

TRUNC.  SEQ. 

0 

0 

0 

0 

2.48 

72.4 

19.8 

2 

0 

0 

0 

256.1 

DEGRADATION  FUNCTION  3 

ORIGINAL 

0 

0 

0 

0 

1.07 

44.9 

36.3 

9.43 

1.42 

0 

0 

360 

SEQUENTIAL 

0 

0 

0 

0 

0.81 

65 

22.3 

3.09 

0 

0 

0 

803.5 

TRUNC.  SEQ. 

0 

0 

0 

0.63 

3.06 

60.3 

22.5 

4.68 

0 

0 

0 

343.84 

DEGRADATION  FUNCTION  4 

ORIGINAL 

0.71 

0 

0 

0.57 

1.56 

47.7 

13.7 

5.41 

0.59 

0 

0 

360 

SEQUENTIAL 

0 

0 

0 

0 

1.23 

58.1 

26.2 

2.28 

0 

0 

0 

1920.8 

TRUNC.  SEQ. 

2.42 

0.91 

0.82 

1.62 

3.68 

57.4 

22.6 

4.31 

0.18 

0 

0 

355.72 

DEGRADATION  FUNCTION  5 

ORIGINAL 

0.87 

0 

0 

0.67 

1.23 

59.5 

8.41 

5.31 

4.38 

2.67 

0 

360 

SEQUENTIAL 

0 

0 

0 

0 

0.58 

71.2 

17.2 

4.17 

3.11 

0 

0 

3553.9 

TRUNC.  SEQ. 

2.64 

0.97 

1.07 

1.68 

2.94 

73.1 

11.3 

3.49 

1.16 

0 

0 

349.62 

DEGRADATION  FUNCTION  6 

ORIGINAL 

0 

0 

0 

0 

1.6 

22 

29.5 

9.92 

2.77 

3.84 

2.64 

360 

SEQUENTIAL 

0 

0 

0 

0 

1.18 

34.1 

34.7 

2.27 

7.24 

4.29 

0 

901.4 

TRUNC.  SEQ. 

0 

0 

0 

1.38 

5.81 

34.2 

33.2 

4.86 

4.06 

4.04 

0.82 

348.13 

While  the  Bayesian  method  does  use  prior  knowledge  of  the  probability  that  a  suit  will  pass  a 
test  and  uses  the  information  gained  from  earlier  tests,  too  much  information  is  lost  in  converting 
the  variable  results  of  the  tests  into  attribute  (pass  /  fail)  results.  It  throws  away  the  information 
that  variable  results  give  in  the  distance  away  a  test  result  is  from  its  minimum  requirement.  The 
Bayesian  prior  information  is  not  enough  to  make  up  for  the  lost  variable  information.  Since  the 
results  are  clearly  inferior  to  the  other  three  methodologies,  the  Bayesian  methodology  is  not 
simulated  under  the  Condition  I  simulations. 
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To  be  fair  to  the  Bayesian  methodology,  it  should  be  reminded  that  we  originally  thought  the 
data  would  be  attribute  pass  /  fail  data.  When  we  discovered  that  our  data  would  be  variable  test 
data,  we  decided  to  try  the  attribute  Bayesian  methodology  anyway.  Appendix  C  gives  the  results 
of  a  comparison  we  did  in  which  we  compared  the  Bayesian  attribute  plan  to  the  original  sampling 
plan  in  which  the  original  sampling  plan  also  used  attribute  data.  The  results  show  that  the 
Bayesian  plan  still  does  not  fare  well  when  compared  to  other  methodologies  that  also  use  attribute 
data. 

After  reviewing  the  results  in  which  the  Bayesian  algorithm  did  not  perform  as  well  as  the 
original  sampling  plan  using  attribute  data,  we  studied  the  Bayesian  methodology  deeper  in  depth. 
We  feel  that  the  algorithm  in  which  the  Bayesian  is  used  is  not  appropriate  in  detecting  the  suits’ 
degradations.  The  reason  for  this  is  that  the  hypothesis  test  is  set  up  so  that  the  current  reliability 
of  the  suit  is  always  accepted  with  a  probability  of  1-  a .  Each  year  that  the  suits  degrade,  the 
believed  reliability  of  the  suits  is  updated.  The  suits,  with  their  updated  reliability,  still  have  a  1- 
a  probability  of  being  accepted,  even  though  their  reliability  has  degraded.  So  as  the  suits 
degrade,  the  probability  of  an  individual  suit  passing  a  test  decreases.  However,  the  probability 
that  the  population  will  be  accepted  at  the  new  updated  reliability  stays  fixed  at  1-  a .  Perhaps  a 
new  Bayesian  methodology  that  utilizes  variable  data  in  determining  when  a  system  has  degraded 
below  a  minimum  requirement  would  be  useful. 

4.7  Standard  Sequential  Results 

The  standard  sequential  methodology  allows  each  test  to  run  as  many  tests  as  needed  to 
satisfy  the  a  and  |3  requirements.  Ihe  average  number  of  samples  used  in  the  lifetime  of  each  test 
is  given  along  with  the  percentage  of  suits  rejected  in  Tables  14  and  15.  Note  that  in  the  Condition 
1  simulations,  where  the  Uo  and  Ui  values  are  closer  together,  more  samples  are  needed  to  meet  the 
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cc  and  P  requirements.  In  Degradation  Function  5  of  the  Condition  I  simulation,  this  sampling 
method  averaged  3554  samples  throughout  the  lifetime  of  the  suits.  Qearly,  3554  samples  would 
destroy  more  suits  than  the  Air  Force  is  willing  to  part  with.  While  the  sequential  methodology  is 
infeasible  to  run  because  of  the  high  number  of  samples  required  by  some  of  the  tests,  it  gives  good 
information  on  the  number  of  samples  needed  to  satisfy  the  most  demanding  a  and  P  requirements 
on  the  various  tests.  The  results  of  the  measure  of  effectiveness  also  give  a  good  benchmark  from 
which  to  gauge  the  success  of  the  other  methodologies. 

When  the  standard  sequential’s  results  in  Tables  14  and  15  are  compared  to  the  Truncated 
Sequential’s  (Section  4.9)  results,  it  is  noticed  that  the  large  differences  in  sample  sizes  result  in 
relatively  little  improvement  in  being  able  to  reject  a  suit  in  the  correct  year.  For  example,  the 
average  sample  sizes  over  all  degradation  functions  and  Condition  values  for  the  standard 
sequential  and  Truncated  Sequential  are  867  and  234,  a  difference  of  633  suits.  The  average 
percentage  of  suits  rejected  correctly  is  54.5  and  52.4  respectively.  The  difference  of  633  suits 
gives  only  a  2. 1  percentage  point  increase  in  accuracy.  This  is  certainly  not  an  effective  way  of 
sampling  suits. 

As  mentioned  earlier,  the  problem  with  unbounded  sequential  sampling  is  that  some  test  may 
always  be  lingering  in  the  “continue  sampling’’  zone  of  the  sequential  test,  pushing  up  the  number 
of  samples,  while  all  other  tests  may  have  made  their  accept  or  reject  decision  after  a  few  samples. 
This  is  the  inherent  problem  with  sequential  sampling.  After  evaluating  the  results  for  the  original 
sampling  plan  in  the  next  section,  we  will  evaluate  the  Tmncated  Sequential  sampling,  which  puts 
an  upper  limit  on  the  number  of  suits  to  sample  using  sequential  sampling. 
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4.8  Original  Sampling  Plan  Results 

The  original  sampling  plan,  as  mentioned  earlier,  is  designed  to  meet  the  a  specifications  of 
the  tests  only.  It  is  not  responsive  to  the  P  requirements  at  aU.  Therefore,  this  methodology 
always  insures  that  it  accepts  the  suits  with  a  .95  probability  when  the  actual  characteristics’ 
means  are  at  the  Uo  level.  This  gives  this  test  the  ability  to  be  very  successful  in  not  rejecting  the 
suits  too  early.  This  is  a  characteristic  that  the  contractor  would  like  very  much.  Unfortunately, 
the  downside  is  that  since  no  attention  is  given  to  the  p  value,  this  methodology  could  be  late  in 
rejecting  a  suit.  This  methodology  may  give  the  indication  that  the  suit  is  good  when  it  actually 
should  be  rejected,  and  this  could  occur  more  often  than  the  Air  Force  should  be  comfortable  with. 

Table  16  shows  the  theoretical  p  values  for  the  original  sampling  plan  for  both  Condition  I 
and  Condition  II  values  of  the  tests.  For  instance,  with  the  Warp  Break  Strength,  p  is  .398  and  0 
for  Condition  I  and  II  values  respectively.  This  is  saying  that  when  Waip  Break  Strength  is  at  its 
actual  Ui  value,  the  suits  will  be  rejected  39.8%  of  the  time  for  Condition  I  values  and  0%  of  the 
time  for  Condition  II  values.  The  Air  Force  would  like  ihe  suits  to  be  rejected  10%  of  the  time 
when  the  actual  mean  of  the  Warp  Break  Strength  is  Ui.  Note  that  the  Beta  values  in  Condition  II 
are  smaller  than  those  from  Condition  I.  The  reason  for  this  as  mentioned  at  the  beginning  of  this 
chapter  is  that  the  distance  between  the  Uo  and  Ui  values  is  much  greater  in  the  Condition  II 
values.  When  viewing  Table  16  recall  that  L  and  UL  stand  for  Laundered  and  Unlaundered  and 
that  W  and  F  stand  for  Warp  and  Fill. 

The  fact  that  the  beta  values  have  such  small  values  in  some  of  the  tests  with  the  Condition  II 
values  has  a  significant  effect  on  the  measure  of  effectiveness  values  for  Condition  II  simulations. 
Table  14  shows  that  in  most  cases,  the  original  sampling  plan  rejects  significantly  more  suits  than 
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do  the  other  methodologies  in  the  Year  Failed  column.  As  a  whole,  the  original  sampling  plan 
rejects  suits  in  a  more  timely  matter  than  do  the  other  methodologies  in  Condition  II  simulations. 


Table  16.  Theoretical  Beta  Values 


T rue  Value  for  Bela  H 

TEST 

Conditon  1 

Condition  ii 

Break 

gKQuszni 

0.398 

0 

Break 

Strength  (F) 

0.060 

0 

Tear 

.348 

.343 

iTear 

.585 

.348 

iSeam 

.274 

0 

m:mssam 

.282 

.011 

1  Spray 

.574 

.140 

.888 

.118 

ESHSnOI 

.916 

.403 

bSHBUlIl 

.761 

0 

tsBSEsnia 

.824 

0 

For  Condition  I  simulations,  a  different  picture  appears  for  the  original  sampling  plan,  hi 
Condition  I  simulations,  the  original  sampling  plan  rejects  suits  in  a  less  timely  matter  than  do  the 
other  two  plans.  Whereas  very  few  suits  are  rejected  early,  more  suits  are  rejected  late, 
comparative  to  the  other  two  methodologies. 

The  reason  for  the  differences  has  to  do  with  the  Beta  values.  With  the  Condition  I  values, 
the  actual  Beta  values  are  relatively  Mgh,  so  with  the  fixed  sampling,  we  expect  to  see  more  suits 
being  rejected  later  than  they  should  be.  On  the  other  hand,  with  tiie  Condition  II  values,  the  actual 
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Beta  values  are  fairly  low  and  very  close  to  zero  for  some  of  the  tests.  When  the  actual  Beta  values 
are  below  the  required .  10  value,  over-sampling  occurs.  The  decision  maker  was  willing  to  take  a 
Beta  risk  of .  10,  but  is  now  paying  for  a  smaller  Beta  risk  than  he  needs.  This  is  a  problem  with 
fixed  sample  methodologies.  The  methodologies  are  usually  devised  to  satisfy  either  a  or  p ,  but 
not  both.  In  this  sampling  plan,  the  methodology  will  satisfy  the  a  requirement,  but  whether  it 
satisfies  the  P  requirement  depends  on  how  far  the  value  Ui  is  from  Uo.  We  feel  that  it  is 
important  to  be  able  to  control  the  Type  II  error. 

4 .9  Truncated  Sequential  Results 

The  truncated  sequential  sampling  results  appear  to  be  the  most  promising.  By  truncating  the 
tests  earlier,  there  is  some  reduction  in  accuracy  when  compared  to  the  standard  sequential  plan. 

In  section  4.7,  it  was  shown  that  when  using  the  Truncated  Sequential  plan  instead  of  the  standard 
sequential,  the  average  reduction  of  633  samples  decreased  the  percentage  of  suits  rejected  in  the 
correct  year  by  only  2.1  percentage  points.  The  case  in  which  the  largest  difference  of  values  in 
the  Year  Failed  column  occurred  was  10  percentage  points,  which  was  in  Degradation  Function  2 
in  Condition  I  sampling.  All  other  differences  between  the  Year  Failed  columns  are  less  than  five 
percentage  points,  and  many  of  them  have  less  than  one  percentage  point  in  difference.  These  are 
certainly  small  percentages  to  pay  when  being  able  to  control  and  reduce  the  sequential  sample 
size. 

When  comparing  the  truncated  sequential  to  the  original  sampling  plan,  the  original  sampling 
plan  performs  better  in  the  Condition  II  simulations  and  the  Truncated  Sequential  performs  better 
in  the  Condition  I  simulations.  The  reason  for  this  difference  is  largely  due  to  the  difference  of  the 
distances  between  Uo  and  Ui  for  Condition  I  and  II  values.  When  actual  real  world  sampling  takes 
place,  a  reasonable  choice  for  Uo  and  Ui  will  fall  between  the  two  sets  of  values  of  Uo  and  Ui 
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used  in  the  simulations.  The  two  sets  of  values  chosen  for  the  simulations  are  at  the  wide  ends  of 
the  lower  and  upper  scale  for  distance  between  the  Uo  andUi  values.  This  being  the  case,  we  set 
up  another  set  of  values,  which  we  will  caU  Condition  III  values,  that  more  closely  model  the  Uo 
and  Ui  values  that  will  occur  in  actual  sampling. 

The  value  chosen  for  Ui  will  be  equal  to  the  minimum  requirement  declared  for  each  test  by 
the  Air  Force.  This  is  the  same  value  for  Ui  as  it  was  in  Type  II  sampling.  Our  choice  for  Uo  is 
half  the  distance  between  Ui  and  the  starting  mean  for  the  test.  The  distance  between  Uo  and  Ui  is 
now  exactly  half  the  distance  as  it  was  in  Type  II  simulations.  These  values  can  be  put  into  words 
as  saying  “when  the  characteristics  degrade  from  their  starting  means  to  halfway  to  the  minimum 
requirement,  we  want  to  accept  them  95%  of  the  time,  when  the  characteristics  have  degraded  to 
their  minimum  requirement,  we  want  to  accept  them  only  10%  of  the  time”.  The  simulations  were 
run  only  for  the  Truncated  Sequential  and  the  Original  sampling  plans.  The  results  for  the 
measurement  of  effectiveness  for  Condition  HI  simulations  are  in  Table  17. 

Table  17  shows  the  results  are  fairly  equal  for  the  percentages  of  suits  rejected  correctly.  To 
get  a  clearer  idea  of  how  the  two  methods  compare,  we  averaged:  1)  the  percentage  of  suits 
rejected  correctly;  and  2)  the  percentage  of  suits  that  were  rejected  correctly,  plus  or  minus  one 
year.  Tire  average  percentages  of  suits  rejected  in  the  correct  year  were  50.43%  and  50.23%  and 
the  average  percentage  of  suits  that  were  rejected  within  one  year  of  the  correct  year  were  75.46% 
and  74.14%,  for  the  original  and  Truncated  Sequential  sampling  plans  respectively.  The 
differences  are  so  small  that  they  are  negligible.  However,  the  Truncated  Sequential  plan  is  able  to 
get  this  information  using  considerably  fewer  number  of  suits.  The  Truncated  Sequential  plan 
averages  205  suits  over  the  lifetime  of  the  suits  while  the  original  sampling  plan  always  uses  360 
suits.  Using  the  values  for  Condition  III  simulations,  the  two  methodologies  appear  to  be  equal  in 
accuracy,  but  not  efficiency. 
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The  two  advantages  that  the  Truncated  Sequential  sampling  plan  has  over  the  oii^nal  plan  is 
that  it  is  able  to  control  its  Type  11  errors  in  addition  to  its  Type  I  errors  and  the  Tmncated 
Sequential  plan  reduces  the  number  of  samples  used.  The  original  sampling  plan  averaged  360 
suits,  whereas  the  Truncated  Sequential  plan  averaged  about  205  suits.  This  is  a  difference  of  155 
suits  on  average. 


Table  17.  Measure  of  Effectiveness,  Condition  III  Simulations 


PER< 

:entage  of  suits 

METHODOLOGIES 

REJECTED,  CONDITION 

II  SIMS 

>4 

4 

3 

2 

1 

1 

2 

3 

4 

>4 

Years 

Years 

Years 

Years 

Year 

YGar 

Year 

Years 

Years 

Years 

Years 

Avg#of 

METHOD 

Early 

Early 

Early 

Early 

Early 

ral  lea 

Late 

Late 

Late 

Late 

Late 

Samples 

DEG 

RADA1 

riON  FUNCTI 

ONI 

ORIGINAL 

0.49 

0.14 

0.11 

0.24 

1.88 

68 

27.3 

0.06 

0 

0 

0 

360 

TRUNC.  SEQ. 

3.19 

0.74 

0.96 

1.3 

5.13 

66 

21.9 

0 

0 

0 

0 

298.56 

DEGRADATION  F 

UNCTION  2 

ORIGINAL 

0.07 

0.16 

0.09 

0.23 

1.56 

31.4 

46.7 

13.5 

4.94 

1.25 

0 

360 

TRUNC.  SEQ. 

0.09 

0.92 

0.93 

1.49 

3.78 

34.6 

50.3 

6.28 

1.24 

0.31 

0 

243.21 

DEGRADATION  FUNCTION  3 

ORIGINAL 

0.2 

0.28 

0.31 

0.51 

1.06 

38.2 

22.3 

15.6 

6.46 

2.91 

360 

SEQUENTIAL 

TRUNC.  SEQ. 

0.45 

0.54 

0.62 

1.33 

39.3 

6.32 

13.3 

17.8 

11 

210.25 

DEGRADA1 

nONF 

UNCTION  4 

ORIGINAL 

0 

0 

0 

1.35 

67.7 

8.3 

8.58 

TRUNC.  SEQ. 

0 

0 

0 

1.8 

1.8 

66.8 

12.4 

5.76 

5.41 

4.52 

3.27 

_ 

DEGRADATION  FUNCTION  5  | 

ORIGINAL 

0 

0 

0 

0 

0 

81.4 

4.35 

2.19 

2.14 

1.96 

7 

TRUNC.  SEQ. 

0 

0 

0 

0 

0 

83.6 

8.52 

4.53 

Miai 

0.81 

0.83 

DEGRADATION  FUNCTION  6  j 

ORIGINAL 

0.12 

0.12 

0.14 

0.27 

1.36 

15.8 

35.4 

24.6 

10.2 

3.34 

0 

360 

TRUNC.  SEQ. 

0.19 

0.52 

0.59 

3.88 

11.1 

28.1 

42 

11 

1.41 

0 
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4.10  Summary 

We  have  reviewed  and  analyzed  the  data  and  derived  results  for  each  methodology.  The 
Pre-posturing  method  was  shown  not  to  be  superior  to  the  original  sampling  plan  and  the 
Aggregated  sequential  plan  was  shown  to  have  considerably  reduced  the  number  of  samples  drawn 
due  to  its  small  variance.  The  Aggregated  sequential  plan  also  made  the  assumption  that  the  tests 
are  independent  of  each  other.  The  Bayesian  plan  was  shown  to  have  been  disappointingly 
unsuccessful  and  the  standard  sequential  plan  needed  too  many  samples  to  be  feasible.  This  left  us 
with  the  original  sampling  plan  and  the  Truncated  sequential  plan.  These  last  two  sampling  plans 
performed  equally  as  well  according  to  the  simulation  results  and  measure  of  effectiveness.  In  the 
next  chapter,  we  will  give  our  conclusions  and  recommendations. 
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5.  Conclusion 


5.1  Findings  of  Sampling  Methodologies 

The  purpose  of  this  thesis  was  to  find  a  sampling  methodology  that  would  monitor  the  shelf- 
life  of  the  Air  Force’s  Chemical  Defense  coverall.  The  goal  was  to  find  a  methodology  that  would 
be  able  to  effectively  detect  the  degradation  in  quality  of  the  suits,  while  at  the  same  time  minimize 
the  number  of  samples  needed  to  accurately  do  this.  Based  on  the  results  of  the  study  and 

simulations,  a  brief  description  is  given  on  the  findings  for  each  of  the  six  methodologies  presented 
in  this  thesis. 

5.1 .1  Pre-Posturing  Sampling  Plan. 

The  purpose  of  the  Pre-Posturing  sampling  method  was  to  distribute  the  suits  before  sampling 
began  in  such  a  way  that  fewer  suits  would  be  sampled  in  the  earlier  years  when  less  samples  were 
thought  to  be  needed  and  more  suits  would  be  sampled  in  the  later  years  when  more  suits  would  be 

required  to  be  tested.  In  doing  so,  it  was  hoped  that  the  Type  II  error  of  accepting  a  bad  suit  would 
be  reduced. 

The  simulations  reinforced  the  belief  that  the  success  of  this  methodology  was  largely  a  factor 
of  being  able  to  accurately  predict  when  the  suits’  characteristics  would  degrade  below  the  set 
minimum  requirement.  If  the  degradation  of  the  suits  occurred  at  one  of  the  years  in  which  few 
suits  were  postured,  the  methodology  was  shown  to  be  ineffective  at  rejecting  a  bad  suit.  This 
methodology  rehed  too  much  on  prior  information  in  knowing  when  the  suits  would  degrade  in 
their  reliability  in  order  to  be  a  useful  methodology. 

5.1 .2  Aggregated  Sequential  Sampling. 

The  puipose  of  this  sampling  plan  was  to  assign  weights  to  each  of  the  tests  according  to  the 
deasion  maker’s  expert  opinion  on  which  tests  were  more  important  in  determining  the  reliability 


81 


of  the  suits.  This  was  the  only  methodology  which  took  into  account  the  decision  maker’s  opinion 
on  how  the  tests  would  relate  to  one  another. 

The  simulations  showed  when  weights  were  added  to  the  tests  and  an  aggregated  sum  was 
achieved,  there  was  a  substantial  reduction  in  the  number  of  samples  needed  for  the  lifetime  of  the 
tests.  The  average  number  of  samples  needed  for  this  test  was  under  60  samples  per  year,  as 
compared  to  the  original  sampling  plan  which  averaged  360  per  year  and  the  Truncated  Sequential 
plan  that  averaged  around  323  suits  per  year  over  the  same  degradation  functions.  The  one 
uncertainty  of  the  test  was  the  assumption  of  independence  between  the  tests.  If  independence  is 
assumed  but  positive  covariance  exists  between  the  tests,  then  the  variance  of  the  aggregated  sum 
is  underestimated.  This  will  result  in  too  few  samples  being  drawn  to  meet  the  given  a  and 
p  requirements.  The  actual  Type  I  and  II  errors  would  be  larger  than  believed.  The  magnitude  of 
these  errors  relies  on  the  size  of  the  covariance  between  the  tests  and  further  study  is  needed  to 
determine  the  effects  of  covariance. 

5.1.3  Attribute  Bayesian  Sampling  Plan. 

This  plan  converted  the  variable  test  results  to  attribute  pass/fail  results  and  used  Bayesian 
prior  knowledge  to  detect  the  degradation  of  the  suits.  The  simulations  showed  that  converting  the 
variable  restilts  to  attribute  data  resulted  in  too  much  loss  of  information  and  this  particular 
methodology  utilizing  Bayesian  statistics  was  not  suitable  for  detecting  the  degradation  of  the 
chemical  suits.  The  Bayesian  concept  does  have  hope  however,  if  a  similar  plan  that  utilizes 
variable  data  can  be  incorporated  with  the  Bayesian  concept  in  detecting  the  degradation  of  the 
chemical  suits’  quality. 

5.1.4  Standard  Sequential  Sampling  Plan. 

This  sampling  plan  used  unbounded  sequential  sampling  to  determine  the  degradation  in 
quality  of  the  chemical  suits.  Due  to  some  of  the  tests  requiring  large  samples  to  make  an  accept 
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or  reject  decision,  this  test  proved  to  be  highly  inefficient.  The  average  sample  size  for  the  lifetime 
of  the  suit  was  867  samples  using  this  methodology.  This  was  by  far  the  largest  average  out  of  all 
the  other  sampling  methodologies.  It  was  also  shown  that  the  increase  in  effectiveness  by  using 
such  a  large  sample  was  minimal.  The  Truncated  Sequential  used  on  average  633  fewer  suits,  yet 
its  ability  to  reject  suits  in  the  correct  year  only  decreased  by  2.1  percentage  points.  The  standard 
sequential  test  is  much  too  inefficient  to  be  of  useful  value  for  sampling  the  chemical  defense 
coveralls. 

5.1.5  Original  Sampling  Plan. 

The  original  sampling  plan  sampled  a  fixed  amount  of  suits  every  year  and  ensured  that  the 
probability  of  a  Type  I  error  would  be  .05.  It  was  unable  to  have  any  effect  in  controlling  the  Type 
II  error  however.  While  this  sampling  methodology  compared  well  to  the  other  methodologies  in 
the  Condition  U  simulations,  it  was  largely  an  effect  due  to  the  distances  between  Uo  and  Ui  for 
the  various  tests.  When  the  distance  between  Uo  and  Ui  is  large,  the  original  sampling  plan  does 
well,  but  when  these  differences  are  small,  the  sampling  plan  does  not  fare  so  well.  The  plan  is  too 
inflexible  to  the  distance  between  Uo  and  Ui,  which  is  a  variable  that  cannot  be  controlled  by  this 
sampling  plan.  The  result  is  the  possibility  of  a  large  Type  II  error,  which  results  in  degraded  suits 
being  labeled  as  acceptable. 

5.1.6  Truncated  Sequential  Sampling  Plan. 

The  Truncated  Sequential  plan  uses  the  concepts  of  the  standard  sequential  plan,  but  puts  a 
limit  on  the  upper  bound  of  suits  that  can  be  sampled.  This  plan  starts  out  with  360  suits  available 
for  sampling,  just  as  the  original  sampling  plan  does.  If  few  suits  are  sampled  in  the  earlier  years 
of  testing,  it  is  able  to  save  these  suits  for  testing  in  the  later  years  if  necessary. 

This  sampling  plan  proved  to  be  the  most  flexible  and  efficient  sampling  plan.  This  plan 
performed  as  well  or  better  than  the  other  methodologies  in  the  Condition  I  and  III  simulations.  In 
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the  Condition  II  simulations,  this  sampling  plan  did  not  do  as  well  as  the  original  sampling  plan 
did.  Since  the  distance  was  so  great  between  Uo  and  Ui  in  the  Condition  n  simulations,  the  original 
sampling  plan  had  very  small  Beta  values  (Table  16).  The  original  sampling  plan  performed  better 
in  the  Condition  II  simulations,  but  it  over-sampled,  giving  the  test  manager  smaller  Beta  values 
than  were  requested.  The  Truncated  Sequential  sampling  plan  however,  is  flexible  to  the  various 
distances  that  may  occur  between  Uo  and  Ui ,  and  is  able  to  control  the  Type  II  error. 

5.2  Recommendations 

I  feci  that  the  Trancated  Sequential  sampling  plan  was  shown  to  be  the  best  sampling  plan  to 
monitor  the  shelf-life  of  the  chemical  defense  coveralls.  This  plan  is  flexible  to  both  the  a  and  P 
requirements  and  flexible  to  the  distances  between  Uo  and  Ui  that  are  set  by  the  decision  makers. 
This  plan  was  also  able  to  reduce  the  number  of  samples  used  to  test  the  suits  when  compared  to 
the  original  sampling  plan. 

The  sampling  plan  is  relatively  easy  to  implement.  It  is  required  that  a  standard  deviation  is 
known  or  estimated  for  the  various  tests  that  will  be  used  in  sampling.  These  standard  deviations 
are  available  through  the  research  and  development  and  pre-testing  of  the  chemical  defense  suits. 
Additional  data  to  help  determine  the  standard  deviations  will  also  be  available  once  the  baseline 
tests  are  run  on  the  coveralls. 

Sequential  Sampling  will  require  the  ones  who  perform  the  actual  tests  to  continually  update 
the  sequential  test  values  after  each  test  is  performed  on  a  suit.  This  way,  when  all  the  tests  make 
an  accept  or  reject  decision,  the  sampling  can  be  stopped  immediately  to  prevent  any  unnecessary 
sampling. 

I  also  believe  that  the  Aggregated  Sequential  sampling  plan  has  potential.  The  number  of 
samples  used  was  greatly  reduced  by  aggregating  the  test  results.  This  plan  requires  that  the 
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decision  makers  determine  how  much  influence  they  would  like  each  test  to  have  in  determining  the 
accept  /  reject  decision  of  the  suits.  Weights  are  assigned  to  each  test  in  relative  importance  and 
the  weights  should  sum  to  one. 

This  plan  needs  analysis  in  determining  the  effects  of  covariance  between  the  tests  on  the 
sequential  sampling.  Covariance  can  possibly  be  estimated  from  the  research  and  development  and 
pre-testing.  Another  alternative  is  to  evaluate  what  the  effects  would  be  if  the  tests  were  assumed 
to  be  independent  but  covariance  actually  existed.  Perhaps  the  Aggregated  Sequential  plan  could 
then  be  modified  to  account  for  the  unknown  covariance. 

If  neither  of  these  plans  is  implemented,  I  feel  it  is  extremely  important  that  some 
consideration  of  the  probability  of  a  Type  n  error  is  taken  into  consideration.  Recall  that  when  a 
Type  II  error  does  occur,  it  is  labeling  degraded  suits  as  reliable.  The  danger  then  exists  that 
commanders  in  the  field  believe  they  have  adequate  protection  against  chemical  warfare  when  in 
fact,  the  protection  may  be  severely  limited  due  to  having  degraded  suits.  Rather  than  setting  the 
original  sampling  plan  such  that  a  =  .05,  maybe  the  sampling  plan  could  be  designed  such  that 
p  =  .05.  Whereas  a  small  a  error  will  prevent  suits  from  being  rejected  while  they  are  stUl  good,  a 
small  p  error  will  prevent  suits  from  being  accepted  when  they  are  no  longer  good. 

5 .3  Recommendations  for  Further  Research 

There  is  still  more  work  and  studies  needed  to  fuUy  understand  this  topic  and  we  now  give  our 
recommendations  for  further  research. 

1 .  A  development  of  Launer  and  Singpurwalla’s  Bayesian  plan  to  use  variable  data  as 
opposed  to  their  attribute  data.  Information  is  lost  when  converting  variable  data  to  attribute  data. 
A  sampling  plan  incorporating  Bayesian  statistical  methods  with  variable  data  may  prove  to  be 
quite  effective. 
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2.  A  study  using  cost  analysis  or  decision  analysis  to  detennine  the  optimal  settings  of  a  and 
P .  As  the  values  for  a  and  p  decrease,  more  sampling  is  needed,  but  the  Type  I  and  Type  II 
errors  decrease.  A  study  to  find  the  optimal  values  for  a  and  P  would  be  valuable.  Along  with 
this  study  could  be  a  determination  of  the  optimal  number  of  samples  that  should  be  allowed  in  the 
Truncated  Sequential  sampling  plan  before  the  sampling  is  truncated. 

3.  A  study  to  test  the  sensitivity  of  assuming  a  normal  distribution  when  the  actual 
distribution  is  of  some  other  type.  We  assumed  the  samples  were  pulled  from  a  nonnal 
distribution.  It  would  be  interesting  to  see  how  the  results  are  affected  if  the  distribution  deviated 
slightly  or  greatly  from  a  normal  distribution. 

4.  A  methodology  to  test  the  suits  and  detennine  if  there  is  a  difference  in  the  degradation 
between  the  various  lots.  A  sampling  plan  could  be  devised  that  would  be  able  to  detect  a 
difference  in  degradation  between  lots.  If  differences  were  detected,  the  plan  would  be  able  to 
reject  the  bad  lots  while  keeping  the  good  lots. 

5.  A  study  on  the  effects  of  covariance  in  the  Aggregated  Sequential  sampling  plan.  This 
plan  showed  a  substantial  reduction  in  the  number  of  samples  to  be  used  when  the  tests  were 
aggregated.  Uncertainty  exists  however,  on  how  the  presence  of  covariance  would  effect  the 
results. 

6.  A  study  that  would  take  into  account  the  possibility  of  the  sampling  tests  being  less  than 
100%  accurate.  This  would  add  complexity  to  the  sampling  model.  The  study  could  also 
determine  what  the  effects  are  when  assuming  100%  accuracy,  when  the  tests  are  actually  less  than 
100%  accurate. 

7.  A  study  that  could  take  into  accoimt  the  effect  that  various  environmental  conditions  have 
on  the  reliability  of  the  suits.  In  this  thesis,  the  suits  that  are  to  be  sampled  are  taken  from  an 
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environmentally  controlled  warehouse.  It  would  be  interesting  to  see  how  the  effects  of  storing  the 
suits  in  less  than  perfect  conditions  affects  the  reliability. 
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Appendix  A .  Computer  Programs 


This  appendix  lists  the  computer  programs  that  were  used  to  simulate  the  sampling 
methodologies.  The  simulations  were  nm  in  SLAM  II  with  the  exceptions  of  the  Bayesian  and 
Truncated  Sequential  plans  which  were  mn  using  FORTRAN.  The  SLAM  II  programs  are 
presented  in  their  text  form  only. 


A.l  Original  Sampling  Plan 

CREATE,.01,„30000,1; 

ACTIVITY„II.LT.l; 

ACTIVITY, „ZAAH; 

ASSIGN,II=1; 

ACTIVITY; 

ZAAH  ASSIGN, ATRIB(l)=XX(II)*246.0,ATRIB(5)=XX(II)*10.5,ATRIB(3)=XXai)*159.17, 
ATRIB(7)=XX(II)*7.5,ATRIB(9)=XX(II)*100.0: 

ACTIVITY; 

ASSIGN,ATRIB(2)=RNORM(ATRIB(l),27.22),ATRIB(6)=RNORM(ATRIB(5),1.342), 
ATRIB(4)=RNORM(ATRIB(3),9.73),ATRIB(8)=RNORM(ATRIB(7),1.342),ATRIB(10)= 
RNORM(ATRIB(9),  10.0); 

ACTIVITY; 

ACCUMULATE,30,30,SUM; 

ACTIVITY; 

ASSIGN,ATRIB(4)=ATRIB(4)/30.0,ATRIB(6)=ATRIB(6)/30.0,ATRIB(8)=ATRIB(8)/ 

30.0,ATRIB(10)=ATRIB(10)/30.0,ATRIB(2)=ATRIB(2)/30.0,6; 

ACTIVITY„ATRIB(2).GE.237.8; 

ACTIVITY,,  ATRIB(4).GE.  156.2,ZAAC; 

ACTIVITY,,  ATRIB(6).GE.  10. 1  ,ZAAD; 

ACTIVITY„ATRIB(8).GE.7.097,ZAAE; 

ACTIVITY„ATRIB(10).GE.97.0,ZAAF; 

ACTIVITY,„ZAAG; 

ASSIGN,ATRIB(1 1)=ATRIB(1 1)+1; 

ACTIVITY; 

COLCT(2),ATRIB(l  1),BREAK  WARP; 

ACTIVITY; 

ZAAB  TERMINATE; 

ZAAC  ASSIGN,ATRIB(12)=ATRIB(12)+1; 

ACTIVITY; 

COLCT(4),ATRIB(12),BREAK  HLL; 

ACTIVITY,„ZAAB; 

ZAAD  ASSIGN,ATRIB(13)=ATRIB(13)+1; 

ACTIVITY; 
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COLCT(6),ATRIB(13),TEAR  WARP; 
ACTIVITY„,ZAAB; 

ZAAE  ASSIGN, ATRIB(14)=ATRIB(14)+1; 
ACTIVITY; 

COLCT(8),ATRIB(14),TEAR  FILL; 
ACTIVITY, „ZAAB; 

ZAAF  ASSIGN, ATRIB(15)=ATRIB(15)+1; 
ACTIVITY; 

COLCT(10),ATRIB(I5),SEAM; 

ACTIVITY,„ZAAB; 

ZAAG  ACCUMULATE, 1000, 1000; 
ACTIVITY; 

ASSIGN,II=II+1; 

ACTIVITY,„ZAAB; 

END; 


A. 2  Pre-Posture  Sampling  Plan 

CREATE,.!  ,,,50000,1; 

ACTIVITY„II.LT.l; 

ACTIVITY,„ZAAH; 

ASSIGN,II=1; 

ACTIVITY; 

ZAAH  ASSIGN,ATRIB(l)=XX(n)*246.0,ATRIB(5)=XX(II)*10.5,ATRIB(3)=XX(II)*159.17, 
ATRIB(7)=XX(II)*7.5,  ATRIB(9)=XX(II)*  100.0; 

ACTIVITY; 

ASSIGN,  ATRIB(2)=RNORM(ATRIB(l ) , 27.22),  ATRIB(6)=RNORM(ATRIB(5), 1 .342), 
ATRIB(4)=RNORM(ATRIB(3),9.73),  ATRIB(8)=RNORM(ATRIB(7), 1 .342),ATRIB(  10)= 
RNORM(ATRIB(9),10.0),ATRIB(16)=ARRAY(6,II),XX(9)=ARRAY(6,II); 

ACTIVITY; 

ACCUMULATE,ATRIB(16),ATRIB(16),SUM; 

ACTIVITY; 

ASSIGN,ATRIB(4)=ATRIB(4)/XX(9),ATRIB(6)=ATRIB(6)/XX(9),ATRIB(8)=ATRIB(8)/ 

XX(9),ATRIB(10)=ATRIB(10)/XX(9),ATRIB{2)=ATRIB(2)/XX(9),6; 

ACTIVITY„ATRIB(2).GE.ARRAY(1,II),;BRKSTRW; 

ACTIVITY„ATRIB(4).GE.ARRAY(2,II),ZAAC;BRKSTRF; 

ACTIVITY„ATRIB(6).GE.ARRAY(3,n),ZAAD;TRSTRW; 

ACTIVITY„ATRIB(8).GE.ARRAY(4,II),ZAAE;TRSTRF; 

ACTIVITY„ATRIB(10).GE.ARRAY(5,II),ZAAF;SEAMSTR; 

ACTIVITY,„ZAAG; 

ASSIGN,ATRIB(1 1)=ATRIB(1 1)+1; 

ACTIVITY; 

COLCT(2),ATRIB(ll),BREAK  WARP; 

ACTIVITY; 

ZAAB  TERMINATE; 

ZAAC  ASSIGN,ATRIB(12)=ATRIB(12)+1; 

ACTIVITY; 
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COLCT(4),ATRIB(12), BREAK  FILL; 
ACTIVITY,„ZAAB; 

ZAAD  ASSIGN, ATRIB(13)=ATRIB(13)+1; 
ACTIVITY; 

COLCT(6),ATRIB(13),TEAR  WARP; 
ACTIVITY, „ZAAB; 

ZAAE  ASSIGN, ATRIB(14)=ATRIB(14)+1; 
ACTIVITY; 

COLCT(8),ATRIB(I4),TEAR  FILL; 
ACTIVITY, „ZAAB; 

ZAAF  ASSIGN,ATRIB(I5)=ATRIB(I5)+I; 
ACTIVITY; 

COLCT(  I0),ATRIB(I5),SEAM; 
ACTIVITY,„ZAAB; 

ZAAG  ACCUMULATE,  1000, 1000; 
ACTIVITY; 

ASSIGN,II=II+I; 

ACTIVITY; 

TERMINATE,!; 

END; 


A3  Sequential  Sampling  Plan 

CREATE,.0I„„I; 

AC'nVITY„II.LT.I; 

ACTIVITY,„ZAAE; 

ASSIGN,II=I; 

ACTIVITY; 

ZAAE  ASSIGN,ATRIB(I)=XX(II)*246; 

ACTIVITY; 

ASSIGN,ATRIB(2)=RNORM(ATRIB(I),27.22),XX(I3)=XX(I3)+ATRIB(2),XX(I4)=XX( 

I4)+I,XX(I5)=29.79+XX(I4)*2I8,XX(I6)=-38.24+XX(I4)*2I8,I; 

ACTIVITY„XX(1 3).GE.XX(  15); 

ACTIVITY„XX(I3).LE.XX(I6),ZAAC; 

ACTIVITY,„ZAAD; 

COLCT,XX(I4),ACCEPTED; 

ACTIVITY; 

ZAAB  COLCT,XX(I4),AVERAGE; 

ACTIVITY; 

ASSIGN,XX(I3)=0,XX(I4)=0,XX(I5)=0,XX(I6)=0; 

ACTIVITY; 

ACCUMULATE,IOOO,IOOO; 

ACTIVITY; 

ASSIGN,II=II+I; 

ACTIVITY; 

TERMINATE,!; 

ZAAC  COLCT,XX(I4),REJECTED; 
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ACnVITY,„ZAAB; 
ZAAD  TERMINATE; 
END; 


A.4  Aggregated  Sequential  Sampling  Plan 

CREATE,.01,„,1; 

ACTIVITY„II.LT.l; 

ACTIVITY,„ZAAE; 

ASSIGN, 11=1; 

ACTIVITY; 

ZAAE  ASSIGN, ATRIB(l)=XX(n)*246,ATRIB(3)=XX(II)*  159. 17,ATRIB(5)=XXai)*  10.5, 
ATRIB(7)=XX(lI)*7.5,ATRIB(9)=XXai)*  100,ATRIB(1  l)=XXai)*98. 17; 

ACTIVITY; 

ASSIGN,ATRIB(13)=XX(II)*98.17,ATRIB(15)=XX(II)*13.7,ATRIB(17)=XX(II)*13.7, 

ATRIB(19)=XX(II)*2.32,ATRIB(21)=XX(II)*2.32; 

ACTIVITY; 

ASSIGN, ATRIB(2)=RNORM(ATRIB(1),27.22),ATRIB(2)=ATRIB(2)-190,ATRIB(2)= 
ATRIB(2)/27.22*.0556,ATRIB(4)=RNORM(ATRIB(3),9.73),ATRIB(4)=ATRIB(4)-115, 

1; 

ACTIVITY; 

ASSIGN,ATRIB(4)=ATRIB(4)/9.73*.0556,ATRIB(6)=RNORM(ATRIB(5),1.342),ATRIB( 

6)=ATRIB(6)-10,ATRIB(6)=ATRIB(6)/1.342*.0556,1; 

ACTIVITY; 

ASSIGN,ATRIB(8)=RNORM(ATRIB(7),1.342),ATRIB(8)=ATRIB(8)- 

7,ATRIB(8)=ATRIB( 

8)/1.342*.0556,ATRIB(10)=RNORM(ATRIB(9),10),ATRIB(10)=ATRIB(10)-70,l; 

ACTIVITY; 

ASSIGN,ATRIB(10)=ATRIB(10)/10*.0556,ATRIB(12)=RNORM(ATRIB(11),8.68),ATRIB( 

12)=ATRIB(12)-90,ATRIB(12)=ATRIB(12)/8.68*.0556,1; 

ACTIVITY; 

ASSIGN,ATRIB(14)=RNORM(ATRIB(13),8.68),ATRIB(14)=ATRIB(14)-90,ATRIB(14)= 

ATRIB(14)/8.68*.0556,ATRIB(16)=RNORM(ATRIB(15),9.55),ATRIB(16)=20-ATRIB( 

16),1; 

ACTIVITY; 

ASSIGN,ATRIB(16)=ATRIB(16)/9.55*.0556,ATRIB(18)=RNORM(ATRIB(17),9.55), 
ATRIB(18)=20-ATRIB(1 8),ATRIB(18)=ATRIB(1 8)/9.55*.0556,l ; 

ACTIVITY; 

ASSIGN,ATRIB(20)=RNORM(ATRIB(19),.29),ATRIB(20)=ATRIB(20)-1.3,ATRIB(20)= 
ATRIB(20)/.29*.25,ATRIB(22)=RNORM(ATRIB(21),.29),ATRIB(22)=ATRIB(22)-1.3, 
ATRIB(22)=ATRIB(22)/.29*.25, 1 ; 

ACTIVITY; 

ASSIGN,XX(17)=ATRIB(2)+ATRIB(4)+ATRIB(6)+ATRIB(8)+ATRIB(10)+ATRIB(12),XX( 

17)=XX(17)+XX(14)+XX(16)+XX(18)+XX(20)+XX(22),1; 

ACTIVITY; 
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ASSIGN, XX(13)=XX(13)+XX(17),XX(14)=XX(14)+1,XX(15)=1.19+XX(14)*-.1441,XX( 
16)=-1.53+XX(14)*-.1441,1; 

ACTIVITY„XX(13).GE.XX(15); 

ACTIVITY„XX(1 3).LE.XX(1 6),ZAAC; 

ACTIVITY,„ZAAD; 

C0LCT,XX(14),ACCEPTED; 

ACTIVITY; 

ZAAB  C0LCT,XX(I4), AVERAGE; 

ACTIVITY; 

ASSIGN, XX(13)=0,XX(14)=0,XX(15)=0,XX(16)=0; 

ACTIVITY; 

ACCUMULATE,1000,1000; 

ACTIVITY; 

ASSIGN,II=II+1; 

ACTIVITY; 

TERMINATE,!; 

ZAAC  C0LCT,XX(14),REJECTED; 

ACTIVITY,„ZAAB; 

ZAAD  TERMINATE; 

END; 


A.5  Truncated  Sequential  Sampling  Plan 

IMPLICIT  DOUBLE  PRECISION(A-H,0-Z) 

INTEGER  BSWFLAG,  BSFFLAG,  TSWFLAG,TSFFLAG,SEAMFLAG 
INTEGER  SPRYFLAG,  CHEMFLAG,  YR,  FLAGS,  I,J,  YR,  TESTS(I2) 

INTEGER  BSWS(12),BSFS(12),TSWS(12),TSFS(12),SEAMS(12) 

INTEGER  SPRYS(12),  H20S(I2),  CHEMS(12),MASTER 
REAL  SUITRUN,  DEGEN1(I2),TESTR(12) 

REALDEGEN2(12),DEGEN3(12),DEGEN4(12),DEGEN5(12),DEGEN6(12) 

REALRNN0F,BSW,BSF,TSW,TSF,SEAM,SPRY,H20,CHEM 

REAL 

BSWSUM,BSFSUM,TSWSUM,TSFSUM,SEAMSUM,SPRYSUM,H20SUM,CHEMSUM 
EXTERNAL  RNNOF,RNSET 


DATA  DEGENl/  I.,l.,l.,l.,l.,.998,.993,.976,.923,.784, 

+  .505,.  165/ 

DATA  DEGEN2/  .999,.998,.993,.983,.961,.921,.85,.74,.6, 

+  .42,.25,.ll/ 

DATA  DEGEN3/  .883,3859,.836,.812,.789,.766,.744,.72,.7, 
+  .68,.66,.64/ 

DATA  DEGEN4/  .78,.75I,.723,.7,.672,.65,.628,.607,.587, 

+  .568,.55,.53/ 

DATA  DEGEN5/  .68,.654,.632,.613,.595,.58,.563,.55,.536, 
+  .523,.51,.5/ 
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DATA  DEGEN6/  .969,.952,.93,.904,.873,.838,.8,.757,.711, 
+  .66, .61, .56/ 

CALL  RNSET(6125) 

DO  20 1=1,12 
TESTS(I)=0 
BSWS(I)=0 
BSFS(I)=0 
TSWS(I)=0 
TSFS(I)=0 
SEAMS(I)=0 
SPRYS(I)=0 
H2OS(I)=0 
CHEMS(I)=0 
20  CONTINUE 

PRINT  *,’TRUNC  2  RUN  1' 

DO  100  MASTER=1,1000 

SUITS=360.0 

T=0.0 

SUITRUN=30.0 
DO  30  YR=1,12 

BSWFLAG=0 

BSFFLAG=0 

TSWFLAG=0 

TSFFLAG=0 

SEAMFLAG=0 

SPRYFLAG=0 

CHEMFLAG=0 

BSWSUM=0.0 

BSFSUM=0.0 

TSWSUM=0.0 

TSFSUM=0.0 

SEAMSUM=0.0 

SPRYSUM=0.0 

H2OSUM=0.0 

CHEMSUM=0.0 


IF  ((FLAGS.NE.7).AND.(T.LT.SUITRUN))  THEN 
40  T=T+1.0 

TESTS(YR)=TESTS(YR)+ 1 
BSW=246.0*DEGEN2(YR)+RNNOF0*27.22 
BSF=  159.17*DEGEN2(YR)+RNNOF()*9.73 
TSW=  10.5*DEGEN2(YR)+RNNOF()*  1 .342 
TSF=  7.5*DEGEN2(YR)+RNNOF()*  1.342 
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ta  ^  ^  Ti  ^  ^ 


SEAM=  100.0*DEGEN2(YR)+RNNOF()*10.0 
SPRY=  98.17*DEGEN2(YR)+RNNOF()*8.68 
H20=  13.7*(2.0-DEGEN2(YR))+RNNOF()*9.55 
CHEM=  2.32*DEGEN2(YR)+RNNOF()*.29 

BSWSUM=BSWSUM+BSW 
BSFSUM=BSFSUM+BSF 
TSWSUM=TSWSUM+TSW 
TSFSUM=TSFSUM+TSF 
SEAMSUM=SEAMSUM+SEAM 
SPRYSUM=SPRYSUM+SPRY 
H20SUM=H20SUM+H20 
CHEMSUM=CHEMSUM+CHEM 

BS  WHI=29.79+21 8 .0*T 
BSWLO=-38.24+218.0*T 
BSFHI=4.83+137.085*T 
BSFLO=-6.2+137.085*T 
TSWHI=8.11+10.25*T 
TSWLO=-10.41+10.25*T 
TSFHI=8.11+7.25*T 
TSFLO=-10.41+7.25*T 
SEAMHI=7.5+85.0*T 
SEAMLO=-9.63+85.0*T 
SPRYHI=20.76+94.085*T 
SPRYLO=-26.65+94.085 *T 
CHEMHI=.19+1.81*T 
CHEMLO=-.24+l  .8 1  *T 

IF  ((BSWSUM.GE.BSWHI).OR.(BSWSUM.LE.BSWLO))  BSWFLAG=1 
((BSFSUM.GE.BSFHI).OR.(BSFSUM.LE.BSFLO))  BSFFLAG=1 
((TSWSUM.GE.TSWHI).OR.(TSWSUM.LE.TSWLO))TSWFLAG=l 
((TSFSUM.GE.TSFHI).0R.(TSFSUM.LE.TSFL0))TSFFLAG=1 
((SEAMSUM.GE.SEAMHI).OR.(SEAMSUM.LE.SEAMLO))SEAMFLAG=l 
((SPRYSUM.GE.SPRYHI).0R.(SPRYSUM.LE.SPRYL0))SPRYFLAG=1 
((CHEMSUM.GE.CHEMHI).0R.(CHEMSUM.LE.CHEML0))CHEMFLAG=1 

FLAGS=BS\VFLAG+BSFFLAG+TSWFLAG+TSFFLAG+SEAMFLAG+ 

C  SPRYFLAG+CHEMFLAG 

IF  ((FLAGS.NE.7).AND.(T.LT.SUITRUN))  GO  TO  40 
END  IF 


BSWCRT=((29.79-38.24)/2.0)+2I8.0*T 
BSFCRT=((4.83-6.2)/2.0)+I37.085*T 
TSWCRT=((8.II-10.41)/2.0)+I0.25*T 
TSFCRT=((8. 1 1-  10.4I)/2.0)+7.25*T 
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SEAMCRT=((7.5-9.63)/2.0)+85.0*T 
SPRYCRT=((20.76-26.65)/2.0)+94.085*T 
H20CRT=((-32.59+41.84)/2.0)+16.85*T 
CHEMCRT=((.  19-.24)/2.0)+l  .8 1  *T 

IF  (BSWSUM.GE.BSWCRT)  BSWS(YR)=BSWS(YR)+1 
IF  (BSFSUM.GE.BSFCRT)  BSFS(YR)=BSFS(YR)+1 
IF  (TSWSUM.GE.TSWCRT)  TSWS(YR)=TSWS(YR)+1 
IF  (TSFSUM.GE.TSFCRT)  TSFS(YR)=TSFS(YR)+1 
IF  (SEAMSUM.GE.SEAMCRT)  SEAMS(YR)=SEAMS(YR)+1 
IF  (SPRYSUM.GE.SPRYCRT)  SPRYS(YR)=SPRYS(YR)+1 
IF  (H20SUM.LE.H20CRT)  H20S(YR)=H20S(YR)+1 
IF  (CIIEMSUM.GE.CHEMCRT)  CHEMS(YR)=CHEMS(YR)+1 

SUITS=SUITS-T 

YRREAL=YR 

IF  (YR.NE.12)  SUITRUN=SUITS/(12.0-YRREAL) 


T=0.0 

BSWFLAG=0 

BSFFLAG=0 

TSWFLAG=0 

TSFFLAG=0 

SEAMFLAG=0 

SPRYFLAG=0 

CHEMFLAG=0 

FLAGS=0 

30  CONTINUE 
100  CONTINUE 

DO  501=1,12 

TESTR(J)=TESTS(J)/1000.0 
PRINT  ’ 

PRINT  *,J 

PRINT  *,  'TESTS  ’,TESTR(J) 
PRINT  *,  'BSW  ',BSWS(J) 
PRINT  *,  'BSF  ',BSFS(J) 
PRINT  *,  'TSW  ',TSWS(J) 
PRINT  *,  'TSF  ',TSFS(J) 
PRINT  *,  'SEAM  ',SEAMS(J) 
PRINT  'SPRAY  ',SPRYS(J) 
PRINT  *,  'WATER  ',H20S(J) 
PRINT  *,  'CHEM  ',CHEMS(J) 
50  CONTINUE 
END 
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A. 6  Bayesian  Sampling  Plan 


IMPLICIT  DOUBLE  PRECISION(A-H,0-Z) 

REALPASSES(12),REJECTS(I2),TESTSIZE(12),TESTAVG(12),DEGEN1(12) 
REALDEGEN2(12),DEGEN3(12),DEGEN4(12),DEGEN5(12),DEGEN6(12) 
REAL  RNNOF 
EXTERNAL  RNNOF, RNSET 


DATA  DEGENl/  1.,L,1.,1.,1.,.998,.993,.976,.923,.784, 

+  .505,.  165/ 

DATA  DEGEN2/  .999,.998,.993,.983,.961,.921,.85,.74,.6, 

+  .42,.25,.ll/ 

DATA  DEGEN3/  .883,3859,.836,.812,.789,.766,.744,.72,.7, 
+  .68,.66,.64/ 

DATA  DEGEN4/  .78,.751,.723,.7,.672,.65,.628,.607,.587, 

+  .568,.55,.53/ 

DATA  DEGEN5/  .68,.654,.632,.613,.595,.58,.563,.55,.536, 
+  .523, .51, .5/ 

DATA  DEGEN6/  .969,.952,.93,.904,.873,.838,.8,.757,.711, 
+  .66,.61,.56/ 

D0469MASTER=1,6 

DO  440KNT4=1,12 
PASSES(KNT4)=0.0 
REJECTS(KNT4)=0.0 
TESTSIZE(KNT4)=0.0 
440  CONTINUE 


PRINT  *,’Chem',MASTER 

FIRSTMN=2.32 

STDDEV=.29 

TSTMIN=1.3 

CALL  RNSET(1966) 


*  LOOP  RUNS  1000  SIMULATIONS 

D0410KNT1=1,100 

*  PRINT  *,'KNT1',KNT1 

*  INITIALIZE  STARTING  VARIABLES 


11=1 

SIG=9.9 

SDELT=.l 

DELT=.49 
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400  IF(II.LE.12)THEN 

CALL  BAYES(SIG,SDELT,DELT,XTM,NTM) 

GOOD=0.0 

BAD-0.0 


DO  420  KNT2=1,NTM 
IF  (MASTER.EQ.1)  THEN 

SUITVAL=FIRSTMN*DEGENl(II)+RNNOF()*STDDEV 
END  IF 

IF  (MASTER.EQ.2)  THEN 

SUITVAL=FIRSTMN*DEGEN2(II)+RNNOF()*STDDEV 
END  IF 

IF  (MASTER.EQ.3)  THEN 

SUITVAL=FIRSTMN*DEGEN3(II)+RNNOF()*STDDEV 
END  IF 

IF  (MASTER.EQ.4)  THEN 

SUITVAL=FIRSTMN*DEGEN4(II)+RNNOF()*STDDEV 
END  IF 

IF  (MASTER.EQ.5)  THEN 

SUITVAL=FIRSTMN*DEGEN5ai)+RNNOF()*STDDEV 
END  IF 

IF  (MASTER.EQ.6)  THEN 

SUITVAL=FIRSTMN*DEGEN6(II)+RNNOF()*STDDEV 
END  IF 


IF  (SUITVAL.GE.TSTMIN)  THEN 
GOOD=GOOD+LODO 
ELSE 

BAD=BAD+1.0D0 
END  IF 

420  CONTINUE 

IF  (GOOD  .GE.  XTM)  THEN 
PASSES(II)=PASSES(II)+1 .0 
SIG=SIG+GOOD 
SDELT=SDELT+BAD 
DELT=(SIG/(SIG+SDELT))-.5D0 
IF  (DELT  .LE.  0.0)  THEN 
PASSES(II)=PASSES(II)- 1 .0 
GO  TO  415 
END  IF 
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TESTSIZE(II)=TESTSIZE(II)+NTM 
11=11+1 
GO  TO  400 
ELSE 

415  REJECTS(II)=REJECTS(II)+1.0 

TESTSIZE(II)=TESTSIZE(II)+NTM 
END  IF 

ELSE 
11=1 
END  IF 

410  CONTINUE 

*  OUTPUT  THE  RESULTS  OF  SIMULATION 
DO  430  KNT3=1,12 

IF  ((PASSES(KNT3)+REJECTS(KNT3)).LE.0.0)  THEN 
TESTA  VG(KNT3)  =  0.0 
ELSE 

TESTAVG(KNT3)=TESTSIZE(KNT3)/(PASSES(KNT3)+REJECTS(KNT3)) 
END  IF 

PRINT  *,'YEAR’,KNT3 

PRINT  *,'TESTSIZE:  ',TESTAVG(KNT3) 

PRINT  ^.’PASSED:  ',PASSES(KNT3) 

PRINT  *,'REJECTED:  ’,REJECTS(KNT3) 

PRINT  ' 

430  CONTINUE 
469  CONTINUE 
END 


SUBROUTINE  BAYES(SGM,SDEL,DEL,XT,NT) 
IMPLICIT  DOUBLE  PRECISION(A-H,0-Z) 
DOUBLE  PRECISION  MDBETA 

13  NT=2 

ALF=.05 

BETA1=.10 

BET=1.0-BETA1 

X1=DEL 

X2=1.0 


*  WE  START  THE  ALGORITHM  BY  INITIATING  XT  AS  ZERO 
W1=SGM 
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W2=SDEL 

A1=W1 

B1=W2 

CALL  FACT1(A1,B1,S0N) 

W=SON 
11  XT=0.0 

WNT=NT 
W4=WNT+SDEL 
TA1=SGM 
TB1=W4 

CALL  FACT2(TA1  ,TB  1  ,TERS) 

PAR=TERS 

C01=W*PAR 

*  THIS  IS  THE  VALUE  WHEN  XT  IS  ZERO 

*  NOW  WE  COMPUTE  THE  VALUE  G1  WHEN  XT  IS  OTHER  THAN  ZERO 
301  IXT=XT 

T0T=C01 

IF  (XT.EQ.0.0)  GO  TO  1001 

DO  1000 1=1, IXT 
RI=I 

P1=W1+RI 

P2=W4-RI 

TA1=P1 

TB1=P2 

CALL  FACT2(TA1  ,TB  1  ,TERS) 

P3=WNT+1.0 

P4=P3-RI 

P5=RU-L0 

Z=(DGAMMA(P3))/((DGAMMA(P4))*(DGAMMA(P5))) 

P=TERS 

TOT=TOT+(P*Z*W) 

1000  CONTINUE 

1001  Gl=TOT 

*  SO  WE  COMPUTED  THE  VALUE  OF  THE  FIRST  CONSTRAINT 

IF  (Gl.GT.ALF)  GO  TO  333 
IF  (XT.EQ.NT)  GO  TO  380 
XT=XT+1.0 
GO  TO  301 
333  XT=XT-L0 

IF  (XT.LT.0.0)  GO  TO  999 

*  OTHERWISE  WE  GO  AND  CALCULATE  G2 
380  WW=W*(DEL**WNT) 

*  NOW  COMPUTE  THE  VALUE  WHEN  XT  IS  ZERO,  THAT  IS  J  IS  ZERO 
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*  WHEN  J  IS  ZERO,  L  IS  ZERO 

*  WHEN  J  IS  ZERO,  M  GOES  FROM  ZERO  TO  NT  AND  L  IS  ALWAYS  ZERO  IN  THIS 
CASE 

*  FIRST  CONSIDER  THE  CASE  WHEN  M  IS  ZERO 

A=W1 

B=W2 

TA1=WI 

TBI=W2 

CALL  FACT2(TA1  ,TB  1  ,TERS) 

Pl=  MDBETA(X1,A,B) 

P2=  MDBETA(X2,A,B) 

Y=TERS 

VALO=(P2-Pl)*Y 

SUM=VALO 

*  NOW  CONSIDER  THE  CASES  WHERE  M  IS  ONE  TO  NT 

DO  1500M=1,NT 
A=W1 
BM=M 

BM1=WNT+1.0 

BM2=WNT-BM+1.0 

BM3=BM+1.0 

BMC0M=DGAMMA(BM1)/((DGAMMA(BM2))*(DGAMMA(BM3))) 

BFAC=(DEL**(-BM))*BMCOM 

B=W2+BM 

TA1=W1 

TB1=B 

CALL  FACT2(TA1,TB1,TERS) 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

Y=TERS 

VAL=(P2-P1)*Y*BFAC 
SUM=SUM+VAL 
1500  CONTINUE 

JXT=XT 

RJSUM=SUM 

*  IF  XT  IS  ZERO  WE  HAVE  ONLY  THE  ABOVE  TERM 

IF  (XT.EQ.0.0)  GO  TO  2001 

DO  2000  J=1,JXT 

*  THIS  IS  THE  MOST  OUTER  SUM 

RJ=J 

RJI=WNT+L0 

RJ2=WNT-RJ+L0 

RJ3=RJ+1.0 

COMBJ=(DGAMMA(RJl))/((DGAMMA(RJ2))*(DGAMMA(RJ3))) 
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*  NOW  L  IS  FROM  ZERO  TO  J.  AGAIN  CONSIDER  THE  CASE  WHERE  L  IS  ZERO 

LP=(-1)**J 

PL=LP 

*  NOTE  WHEN  L  IS  ZERO,  M  GOES  FROM  ZERO  TO  NT-J 

LJL=NT-J 

IF  (LJL.EQ.0)  GO  TO  2101 

DO  2100  M=1,LJL 
RRM=M 

RRM1=WNT-RJ+1.0 
RRM2=WNT-RJ-RRM+ 1 .0 
RRM3=RRM+1.0 

RC0M=(DGAMMA(RRM1))/((DGAMMA(RRM2))*(DGAMMA(RRM3))) 

FFAC=(DEL**(-RRM))*RCOM 

A=SGM 

B=RRM+SDEL 

TA1=A 

TB1=B 

CALL  FACT2(TA1  ,TB  1  ,TERS) 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

Y=TERS 

VALM=(P2-P1)*FFAC*Y 

VALO=VALO+VALM 

2100  CONTINUE 

2101  RLSUM=VALO*PL 

*  THIS  IS  THE  VALUE  WHEN  L  IS  ZERO 

*  NOW  WE  WANT  TO  CONSIDER  L  FROM  1  TO  J.  THIS  IS  THE  SECOND  SUM 

D0  2500L=1,J 
RL=L 

RL1=RJ-RL+1.0 

RL2=RL+1.0 

C0MBL=(DGAMMA(RJ3))/((DGAMMA(RL1))*(DGAMMA(RL2))) 

LPL=(-1)**(J-L) 

FLP=LPL 

POWER=DEL**(-RL) 

FACL=FLP*COMBL*POWER 

*  NOW  CONSIDER  M  LOOP  AGAIN.  NOW  M  IS  FROM  ZERO  TO  NT-J  FOR  GIVEN  L 

*  START WITHM  =  0 

A=RL+SGM 

B=SDEL 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

TA1=A 

TB1=B 
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CALL  F  ACT2(TA  1  ,TB  1  ,TERS) 

Y=TERS 

VAL=(P2-P1)*Y 

RMSUM=VAL 

LL=NT-J 

IF  (LL.EQ.O)  GO  TO  3001 

D0  3000M=LLL 
RM=M 

RM1=WNT-RJ+1.0 
RM2=WNT-RJ-RM+ 1 .0 
RM3=RM+1.0 

COMBM=(DGAMMA(RMl))/((DGAMMA(RM2))*(DGAMMA(RM3))) 

FACM=(DEL**(-RM))*(COMBM) 

A=RL+SGM 

B=RM+SDEL 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

TA1=A 

TB1=B 

CALL  FACT2(TA1  ,TB  1  ,TERS) 

Y=TERS 

VAL=(P2-P1)*FACM*Y 

RMSUM=RMSUM+VAL 

3000  CONTINUE 

3001  RRSUM=RMSUM 

*  THE  MOST  INNER  LOOP  IS  HNISHED 

RLSUM=(FACL*RRSUM)+RLSUM 

*  THIS  IS  THE  SUM  FOR  L  LOOP 

2500  CONTINUE 
*L  LOOP  IS  FINISHED 

*  NOW  FINISH  J  LOOP,  THE  MOST  OUTER  LOOP 

RJSUM=(COMBJ*RLSUM)+RJSUM 

2000  CONTINUE 

*  SO  WE  EVALUATED  G2 

2001  G2=RJSUM*WW 

IF  (G2.LE.BET)  THEN 
IF  (XT.LE.L0)  GO  TO  999 
XT=XT-L0 
GO  TO  380 
END  IF 

777  IF  (XT.LT.L0)  GO  TO  888 
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XT=XT-1.0 

*  CHECK  G2  AGAIN 

WW=W*(DEL**WNT) 

*  NOW  COMPUTE  THE  VALUE  WHEN  XT  IS  ZERO,  THAT  IS  WHEN  J  IS  ZERO 

*  WHEN  J  IS  ZERO,  L  IS  ZERO 

*  WHEN  J  IS  ZERO,  M  GOES  FROM  ZERO  TO  NT  AND  L  IS  ALWAYS  ZERO  IN  THIS 
CASE 

*  FIRST  CONSIDER  THE  CASE  WHEN  M  IS  ZERO 

A=W1 

B=W2 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

TA1=A 

TB1=B 

CALL  FACT2(TA1  ,TB  1  ,TERS) 

Y=TERS 

VALO=(P2-Pl)*Y 

SUM=VALO 

*  NOW  CONSIDER  THE  CASES  WHERE  M  IS  ONE  TO  NT 

DO  1501  M=1,NT 
A=W1 
BM=M 

BM1=WNT+1.0 

BM2=WNT-BM+1.0 

BM3=BM+1.0 

BMCOM=DGAMMA(BMl)/((DGAMMA(BM2))*(DGAMMA(BM3))) 

BFAC=(DEL**(-BM))*BMCOM 

B=W2+BM 

TA1=A 

TB1=B 

CALL  FACT2(TA1,TB1,TERS) 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

Y=TERS 

VAL=(P2-P1)*Y*BFAC 
SUM=SUM+VAL 
1501  CONTINUE 

JXT=XT 

RJSUM=SUM 

*  IF  XT  IS  ZERO,  WE  HAVE  ONLY  THE  ABOVE  TERM 

IF  (XT.EQ.0.0)  GO  TO  201 1 

DO  5000  J=1,JXT 

*  THIS  IS  THE  MOST  OUTER  TERM 

RJ=J 
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RJ1=WNT+1.0 

RJ2=WNT-RJ+1.0 

RJ3=RJ+1.0 

COMBJ=(DGAMMA(RJl))/((DGAMMA(RJ2))*(DGAMMA(RJ3))) 

*  NOW  L  IS  FROM  ZERO  TO  J.  AGAIN  CONSIDER  THE  CASE  WHERE  L  IS  ZERO 

LP=(-1)**J 

PL=LP 

*  NOTE  WHEN  L  IS  ZERO  M  GOES  FROM  ZERO  TO  NT-J 

LJL=NT-J 

IF  (LJL.EQ.0)  GO  TO  2102 

DO  2105  M=1,LJL 
RRM=M 

RRM1=WNT-RJ+1.0 
RRM2= WNT-RJ-RRM+ 1 .0 
RRM3=RRM+1.0 

RC0M=(DGAMMA(RRM1))/((DGAMMA(RRM2))*(DGAMMA(RRM3))) 

FFAC=(DEL**(-RRM))*RCOM 

A=SGM 

B=RRM+SDEL 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

TA1=A 

TB1=B 

CALL  FACT2(TA1,TB1,TERS) 

Y=TERS 

VALM=(P2-P1)*FFAC*Y 
VALO=VALO+VALM 
2105  CONTINUE 

2102  RLSUM=VALO*PL 

*  THIS  IS  THE  VALUE  WHEN  L  IS  ZERO 

*  NOW  WE  WANT  TO  CONSIDER  L  FROM  1  TO  J.  THIS  IS  THE  SECOND  SUM 

DO  2501  L=1,J 
RL=L 

RL1=RJ-RL+1.0 

RL2=RL+1.0 

C0MBL=(DGAMMA(RJ3))/((DGAMMA(RL1))*(DGAMMA(RL2))) 

LPL=(-1)**(J-L) 

FLP=LPL 

POWER=DEL**(-RL) 

FACL=FLP*COMBL*POWER 

*  NOW  CONSIDER  M  LOOP  AGAIN.  NOW  M  IS  FROM  ZERO  TO  NT-J  FOR  GIVEN  L 

*  START  WITH  M  =  ZERO 

A=RL+SGM 

B=SDEL 
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P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A,B) 

TA1=A 

TB1=B 

CALL  F  ACT2(TA  1  ,TB  1  ,TERS) 

Y=TERS 

VAL=(P2-P1)*Y 

RMSUM=VAL 

LL=NT-J 

IF  (LL.EQ.O)  GO  TO  4001 

DO  4000  M=1,LL 
RM=M 

RM1=WNT-RJ+1.0 
RM2=WNT-RJ-RM+ 1 .0 
RM3=RM+1.0 

C0MBM=(DGAMMA(RM1))/((DGAMMA(RM2))*(DGAMMA(RM3))) 

FACM=(DEL**(-RM))*(COMBM) 

A=RL+SGM 

B=RM+SDEL 

P1=MDBETA(X1,A,B) 

P2=MDBETA(X2,A3) 

TA1=A 

TB1=B 

CALL  FACT2(TA1,TB1,TERS) 

Y=TERS 

VAL=(P2-P1)*FACM*Y 

RMSUM=RMSUM+VAL 

4000  CONTINUE 

4001  RRSUM=RMSUM 

*  THE  MOST  INNER  LOOP  IS  HNISHED 

RLSUM=(FACL*RRSUM)+RLSUM 

*  THIS  IS  THE  SUM  FOR  L  LOOP 

2501  CONTINUE 
*L  LOOP  IS  FINISHED 

*  NOW  FINISH  J  LOOP,  THE  MOST  OUTER  LOOP 

RJSUM=(COMBJ*RLSUM)+RJSUM 
5000  CONTINUE 

*  SO  WE  EVALUATED  G2 

2011  G2=RJSUM*WW 

*  CHECK  G2  NOW 

IF  (G2.GE.BET)  GO  TO  777 
XT=XT+1.0 
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GO  TO  888 
999  NT=NT+1 

IF  (NT.GE.75)  THEN 
SGM=SGM-.2 
GO  TO  13 
END  IF 
GOTO  11 
888  RETURN 
END 


SUBROUTINE  FACTl  (A1  ,B  1  ,SON) 
IMPLICIT  DOUBLE  PRECISION(A-H,0-Z) 
C=A1+B1 

IF  (A1.LE.57.0.AND.C.LE.57.0)  GO  TO  41 

C1=C-1.0 

A2=A1-L0 

B2=B1-L0 

C2=A2+B2 

IB=A2+L0 

IC=C2 

PAY=C1 

D0  42I=IB,IC 
ZI=I 

PAY=PAY*ZI 
42  CONTINUE 

PAYDA=1.0 

JA=B2 


D0  43  J=1,JA 
VJ=J 

PAYDA=PAYDA*VJ 
43  CONTINUE 

SON=PAY/PAYDA 
GO  TO  45 

41  SON=DGAMMA(C)/((DGAMMA(Al))*(DGAMMA(B  1))) 
45  CONTINUE 
RETURN 
END 


SUBROUTINE  FACT2(TA1  ,TB  1  ,TERS) 
IMPLICIT  DOUBLE  PRECISION(A-H,0-Z) 
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C=TA1+TB1 

IF  (TA1.LE.57.0.AND.C.LE.57.0)  GO  TO  71 

C1=C-1.0 

A2=TA1-1.0 

B2=TB  1-1.0 

C2=A2+B2 

IB=A2+1.0 

IC=C2 

PAY=C1 

DO  72  I=IB,IC 
ZI=I 

PAY=PAY*ZI 
72  CONTINUE 

PAYDA=1.0 

JA=B2 


DO  73  J=1,JA 
VJ=J 

PAYDA=PAYDA*VJ 
73  CONTINUE 

TERS=PAYDA/PAY 
GO  TO  75 

71  TERS=((DGAMMA(TA1))*(DGAMMA(TB1)))/(DGAMMA(C)) 
75  CONTINUE 
RETURN 
END 


DOUBLE  PRECISION  FUNCTION  LGAMMA(XX) 

INTEGER  J 

DOUBLE  PRECISION  COF(6),STP,HALF,ONE,FPF,X,XX,TMP,SER 
DATA  COF,STP/76.1 8009173DO,-86.50532033DO,24.01409822DO, 

+  -1.231739516D0,.120858003D-2,-.536382D-5,2.50662827465D0/ 

DATA  HALF, ONE, FPF/0.5D0, 1 .0D0,5 .5D0/ 

X=XX-ONE 

TMP=X+FPF 

TMP=(X+HALF)*DLOG(TMP)-TMP 
SER=ONE 
DO  131  J=l,6 

X=X+ONE 

SER=SER+COF(J)/X 
131  CONTINUE 

LGAMMA=TMP+DLOG(STP*SER) 

RETURN 

END 
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DOUBLE  PRECISION  FUNCTION  DGAMMA(XX) 

INTEGER  J 

DOUBLE  PRECISION  COF(6),STP,HALF,ONE,FPF,X,XX,TMP,SER,LNGAMMA 
DATA  COF,STP/76.1 8009173D0,-86.50532033D0,24.01409822D0, 

+  -1.23 17395 16D0,.  1 20858003D-2,-.536382D-5,2.50662827465D0/ 

DATA  HALF, ONE,FPF/0.5D0,L0D0,5.5D0/ 

X=XX-ONE 

TMP=X+FPF 

TMP=(X+HALF)*DLOG(TMP)-TMP 
SER=ONE 
DO  141  J=l,6 
X=X+ONE 
SER=SER+COF(J)/X 
141  CONTINUE 

LNGAMMA=TMP+DLOG(STP*SER) 

DGAMMA=DEXP(LNGAMMA) 

RETURN 

END 


DOUBLE  PRECISION  FUNCTION  MDBETA(X1  l.Vl.Wl) 

DOUBLE  PRECISION  BT,BT1,BT2,LGAMMA,V1,W1,BETACF,X11 
IF(X1LEQ.O..OR.X1LEQ.L)  THEN 
BT=0.0 
ELSE 

BT1=LGAMMA(V1+W1)-LGAMMA(V1)-LGAMMA(W1) 
BT2=V1*DL0G(X1 1)+W1*DLOG(LO-X1 1) 

BT=DEXP(BT1+BT2) 

END  IF 

IF(Xll.LT.(Vl+L)/(Vl+Wl+2.))  THEN 
MDBETA=BT*BETACF(V1,W1,X1 1)A^1 
RETURN 
ELSE 

MDBETA=1.-BT*BETACF(W1,V1  ,L-X1 1)/W1 
RETURN 
END  IF 
END 


DOUBLE  PRECISION  FUNCTION  BETACF(S2,T2,X21) 

INTEGER  M 

DOUBLE  PREQSION  AM,BM,AZ,QAB,QAP,QAM,EM,TEM,S2,T2,X21 
DOUBLE  PRECISION  D,AP,BP,APP,BPP,AOLD,BZ 
PARAMETER(ITMAX=  100,EPS=3.E-7) 

AM=1. 
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BM=1. 

AZ=1. 

QAB=S2+T2 

QAP=S2+1. 

QAM=S2-1. 

BZ=1.-QAB*X21/QAP 

DO  91  M=1,ITMAX 
EM=M 

TEM=EM+EM 

D=EM*(T2-M)*X21/((QAM+TEM)*(S2+TEM)) 

AP=AZ+D*AM 

BP=BZ+D*BM 

D=-(S2+EM)*(QAB+EM)*X21/((S2+TEM)*(QAP+TEM)) 

APP^AP+D*AZ 

BPP=BP+D*BZ 

AOLD=AZ 

AM=AP/BPP 

BM=BP/BPP 

AZ=APP/BPP 

BZ=1. 

IF(DABS(AZ-AOLD).LT.EPS*DABS(AZ))  GO  TO  95 
91  CONTINUE 
95  BETACF=AZ 
RETURN 
END 
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Appendix  B.  Simulation  Results 


The  results  of  the  simulations  are  presented  in  Excel  spreadsheets.  Each  spreadsheet  contains 
the  simulations  that  were  run  for  a  particular  sampling  plan  over  one  degradation  function. 

The  spreadsheet  contains  information  on  the  type  of  sampling  plan  mn  and  which  degradation 
plan  was  used  at  the  top  of  the  spreadsheet.  Also  a  I  or  11  after  the  plan  indicates  if  it  was  a 
Condition  I  or  a  Condition  II  simulation.  The  very  left  column  contains  the  name  of  the  tests  that 
were  performed  on  the  tests.  The  second  and  third  columns  give  information  on  the  starting  means 
and  standard  deviations  of  the  tests  and  the  minimum  values  required  for  the  tests  and  the  Ui 
values. 

The  rows  of  the  tests  give  the  current  means  of  the  suits,  the  number  of  suits  accepted  out  of 
1000  samples,  and  the  average  sample  size  of  the  tests.  Note  that  in  the  Bayesian  plans,  the 
numbers  that  are  accepted  and  rejected  are  given  since  the  samples  are  dependent  on  what 
happened  the  year  before,  unlike  the  other  sample  plans.  Also  in  the  AF  sampling  plan  I,  the 
results  of  the  Pre-Posturing  results  are  given  immediately  under  the  results  of  the  original  sampling 
plan.  Also  note  in  the  truncated  sequential  results,  the  average  sample  size  is  given  only  once  since 
aU  tests  used  the  same  number  of  samples. 
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_  Sequential  Sample  Plan  I,  Weibull  (.065,15) 
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Appendix  C.  Original  Sampling  Plan  with  Attribute  Data 


In  this  appendix,  we  show  the  results  of  simulating  the  original  sampling  plan  with  attribute 
data,  rather  than  variable  data,  so  we  could  compare  the  results  to  the  Bayesian  methodology 
which  also  uses  attribute  data.  If  a  test  result  was  greater  than  the  minimum  requirement,  it  was 
considered  a  pass,  and  if  a  test  result  was  less  than  the  minimum  requirement,  it  was  considered  a 
fail,  just  as  in  the  Bayesian  methodology.  We  ran  the  same  tests  as  in  the  Bayesian  methodology 
and  we  came  up  with  the  following  results.  The  original  sampling  plan  results,  which  are  listed  as 
binomial,  are  given  along  with  the  Bayesian  results  so  they  can  be  compared. 


Table  18.  Bayesian  Plan  and  Original  Sampling  Plan  with  Attribute  Data 
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Early 

2 

Years 

Early 

1 

Year 

Early 

Year 

Failed 

1 

Year 

Late 

2 

Years 

Late 

3 

Years 

Late 

4 

Years 

Late 

>4 

Years 

Late 

Avg  #  of 
Samples 

DEGRADATION  FUNCTION  1 

BINOMIAL 

4.811 

1.51 

1.465 

1.645 

2.263 

6.217 

27.59 

43.86 

10.64 

0 

0 

360 

BAYESIAN 

2.86 

1 

1.29 

1.14 

1.57 

2.43 

18.71 

26.14 

43.57 

1.29 

0 

321 .47 

DEGRADATION  FUNCTION  2 

BINOMIAL 

0 

0 

1.777 

1.85 

2.543 

4.644 

12.74 

23.63 

20.35 

27.48 

4.987 

360 

BAYESIAN 

0 

0 

1.71 

0 

2 

1.43 

6.43 

16.71 

17.29 

22 

30.43 

233.14 

DEGRADATION  FUNCTION  3 

BINOMIAL 

0 

0 

0 

0 

0 

57.16 

10.36 

8.182 

9.513 

6.437 

8.35 

360 

BAYESIAN 

0 

d 

0 

0 

0 

30.43 

0 

15.29 

4.29 

3.86 

37.14" 

151.26 

DEGRADATION  F 

UNCTION  4 

BINOMIAL 

0 

0 

0 

0 

0 

90.02 

5.95 

3.228 

0.75 

0.055 

0 

360 

BAYESIAN 

0 

0 

0 

0 

0 

53.14 

10.29 

9.86 

8.14 

4 

14.57 

125 

DEGRADATION  F 

UNCTION  5 

BINOMIAL 

0 

0 

0 

0 

0 

99.63 

0.366 

0 

0 

0 

0 

360 

BAYESIAN 

0 

0 

0 

0 

0 

83.57 

5.43 

2.43 

1.14 

1.57 

5.86 

176.61 

DEGRADATION  F 

UNCTION  6 

BINOMIAL 

0 

d 

0 

0 

0 

11.22 

16.95 

16.9 

11.35 

7.782 

35.8 

360 

BAYESIAN 

0 

d 

0 

0 

0 

3 

5.57 

15 

12.57 

6.43 

54.71 

185.22 
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Note  that  the  Bayesian  plan  still  does  not  perfonn  as  well  even  when  compared  to  another 
methodology  using  attribute  data.  The  Bayesian  plan  rejects  less  suits  in  the  Year  Failed  column 
than  does  the  original  sampling  plan  when  it  also  uses  attribute  data.  See  section  4.6  for  an 
explanation  as  to  why  the  Bayesian  plan  does  not  do  so  well. 
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